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CHAPTER  I.  INTRODUCTION  AND  PRELIMINARIES 


Section  1.  Examples  of  Eigenvalue  Problems 


In"  this  section  present yseveral  model  eigenvalue  problems 
arising  in  physics  and  engineering.  Specifically, — we— wi.-i.-l>  discuss 
briefly  some  important  physical  interpretations  of  eigenvalues  and 
eigenfunctions.  Some  of  the  model  problems  we  discuss  here  will 
serve  as  illustrative  examples  in  connection  with  the  approximation 

l 

methods  considered , in  Chapter  III.  We  will  attempt  to  provide  a 
clear  understanding  of  the  fundamental  ideas,  but  will,  not  present , 
a  detailed  treatment.  For  a  more  complete  discussion  of  the  mate¬ 
rial  in  this  section  we^  refer  to  Courant-Hilbert  [1953]. 

A .  One  Dimensional  Problems 

The  Longitudinal  Vibration  of  an  Elastic  Bar 


We  are  interested  in  studying  the  small,  longitudinal  vibra¬ 
tions  of  a  longitudinally  loaded,  elastically  supported,  elastic 
bar  with  masses  attached  to  its  ends.  The  bar  is  shown  in  Figure 


1.1. 


ELASTIC  SUPPORT 


ELASTIC  SUPPORT 


/  \  \  /  ^  ^ 
END  MASS  LOAD  ELASTIC  SUPPORT 


END  MASS 


Figure  1.1.  Elastic  Bar. 


i 

jjj  We  now  derive  the  governing  differential  equation  and  boundary 

conditions  for  the  problem.  First  we  consider  the  static  prob¬ 
lem  .  Suppose 

f(x),  0  <  x  <  €,  represents  the  external  longitudinal  load,  with 
positive  f(x)  denoting  a  force  directed  to  the  right, 
u(x),  0  <  x  <  *■,  denotes  the  displacement  of  the  cross-section  of 
the  bar  originally  at  x,  with  positive  u(x)  denoting  the 
displacement  to  the  right,  so  that  the  position  of  a  point 
originally  at  x  is  x  +  u(x) , 

c  (x)  ,  0  <  x  <  denotes  the  strain  in  the  x-direction ,  i.e.,  the 
relative  change  in  the  length  of  the  fibers  in  the  bar  (e(x) 
will  be  positive  if  it  describes  extension) , 
a(x),  0  <  x  <  €,  denotes  the  normal  stress  in  the  cross-section 

at  x,  i.e.,  the  force  per  unit  area  exerted  by  the  portion 
of  the  bar  to  the  right  of  x  on  the  portion  to  the  left  of 


(a(x)  will  be  positive  if  it  describes  tension), 

A(x) ,  0  <  x  <  1,  denotes  the  area  of  the  cross-section  at  x, 

E(x)  ,  0  <  x  <  denotes  the  modulus  of  elasticity  of  the  bar  at 
x, 

F(x),  0  <  x  <  denotes  the  internal  force  acting  on  the  cross- 
section  at  x,  i.e.,  the  force  exerted  by  the  portion  of 
the  bar  to  the  right  of  x  on  the  portion  to  the  left,  with 
positive  F(x)  denoting  a  force  directed  to  the  right, 

,°(x),  0  <  x  <  denotes  the  load  due  to  the  ( continuous )  elastic 
support ,  which  is  assumed  to  be  of  the  form 


P (x)  =  — c(x)u(x) , 


where  c(x)  >0  is  the  spring  constant  of  the  support  (the 


negative  sign  indicates  that  the  force  is  directed  opposite 
to  the  displacement),  and 


m(x) ,  0  <  x  <  l,  denotes  the  specific  mass  at  x, 
per  unit  volume  at  x. 

The  strain  c(x)  and  the  displacement  u(x) 

. ,  .  du ,  . 

Mx)  =  ^(x). 


i . e . ,  the  mass 


are  related  by 


This  relation  is  valid  for  small  displacements,  i.e.,  when  !Mx)j 
<'  1.  The  relation  between  stress  and  strain  is  described  by  the 
constituitive  law  of  the  material.  We  are  assuming  the  linear 
relation  given  by  Hooke 1 s  Law: 

o (X)  =  E(x)£ (x) . 

Thus,  since  F(x)  =  a(x)A(x),  we  have 

F(x)  =»  A ( x )  E ( x ) £  (x) 

-  A(x)E(x)^(x)  . 

Now  the  equilibrium  condition  for  the  bar  is 

^|(x)  +  f { x )  +  p(x)  =  0, 

which,  with  the  use  of  the  relations  discussed  above,  can  also  be 
written  as 

( 1  •  1 )  "Hx(A(x)E(x)3^(x)  )  +  c<x)u(x>  =  f(x)>  0  <  x  < 

This  is  the  governing  differential  equation. 

We  consider  the  three  most  important  types  of  boundary  condi¬ 
tions  . 


V 


Dirichlet  Type 

(1.2a) 


u{  0 )  =  a  ,  u(')  =  a 


Here  the  displacements  of  the  end  points  of  the  bar  are  given. 


Neumann  Type 

(1.2b)  -F(0)  =  -(AE^H)(0)  =  b1#  F  (■£)  =  (AEg£)K)  =  b2 

Here  the  forces  at  the  ends  of  the  bar  are  given.  The  different 
signs  at  0  and  f  are  used  to  express  the  outer  normal  deriva¬ 
tive  at  the  ends  of  the  bar. 

Newton  Type 

(1.2c)  “(AEgH)(0)  +  y  ^u(  0 )  =  Cx,  (AEgj)(€)  +  Y  =  Cg, 

where  Y ^,Y 2  >  Q 

Here  y 2  is  the  spring  constant  of  a  spring  attached  to  the  bar  at 
x  =  l  and  -^2u(^)  is  the  force  exerted  on  the  right  end  of  the 
bar  by  the  spring.  We  are  thus  specifying  the  sum  of  the  internal 
force  and  the  spring  force  on  the  right  end  of  the  bar.  The  con¬ 
dition  at  x  =  0  has  a  similar  interpretation. 

(1.1)  together  with  one  of  (1.2a,b,c)  determine  the  displace¬ 
ment  u(x)  in  the  static  case.  We  now  turn  to  the  dynamic  case . 

We  assume  the  external  load  depends  on  the  time  t  and  is 
represented  by  f(x,t)  and  suppose  a^,b^,c^  in  the  boundary  con¬ 
ditions  depend  on  t  :  =  a^t),  b.^  =  b^t),  c^,  =  c^t),  i  =  1,2. 

We  further  suppose  the  bar  is  subject  to  a  damping  force  repre¬ 
sented  by  R.  If  u  =  u(x,t)  is  the  displacement  at  time  t, 
then  from  Newton's  2nd  law  we  have 


a2 

-  m ( x) A( x ) — ^(x, t )  -  R, 
<»t 


(1.3) 

'“X 

-jfa  (A(x)E(x)^(x,t))  +  c  (  x )  u  (  x ,  t ) 


f (X,t) 


0  <  X  <  *  ,  t  >  0 . 


■vw 


Consider  now  equations  (1.3),  with  f  =  R  =  0,  and  one  of 


the  conditions  (1.4a,b,c),  with  a.  =  a  =  b  =  b  =c  =c  =0. 

12  12  12 


If  we  seek  separated  solutions  of  the  form 


u ( X , t )  =  v(x)w(t) , 


in  which  the  spatial  variable  x  and  the  temporal  variable  t 


are  separated,  from  (1.3)  we  find  that 


2 

t-^(  A(x)m(x)^(x)  )  +  c(x)v(x)  ]w(t)  =  -m(x)  A(x)  v(x)  — t ) 

dt 


(1.6) 


-  ^(A(x)E(x)  |^(x))  +  c(x)v(x) 


m(x)A(x) v(x) 


•  ,  0  <  x  <  ■£,  t  >  0 , 


Imposing  the  boundary  conditions  (1.4a,b,c)  on  u  =  vw  we  find 


(1.7a) 


v(0)w(t)  =  0,  v(^)w(t)  -  0 ,  t  >  0 


-(AEg)(0) 


m1v(0) 


dt 

w(  t ) 


(1.7b) 


(ABglKI 


m2v(^ ) 


dt 

w(t) 


,  t  >  0, 


(1.7c) 


■(AE§g)(0)  +  y  x  v(  0  ) 


m  ^  v (0 . 


(A  cE,K)  +  ^2vK) 


dt 

w(t)  ' 


t  >  0, 


It  is  immediate  that  both  sides  of  equation  (1.6)  equal  a 


N  *•  %  aV  '«  N 


constant,  which  we  denote  by  X . 


We  are  thus  led  to  seek  a  number 


\  and  a  function  v(x)  *  0  so  that 


(l.B)  -^(  A(x)  E(x)^(x)  )  +  c(x)v(x)  =  lm(x) A(x) v(x) ,  0  <  x  <  t. 


From  (1.7a,b,c)  we  get  boundary  conditions  for  v: 


(1.9a] 


v(0)  =  v(f)  =  0, 


(Dirichlet  type] 


(1.9b) 


,dv 


-(AE^,  (0)  =  \mlV(0) 


(Neumann  type) 


,dv, 


(AE^)  (t)  =  lm2v(^)  , 


(1.9c' 


,dv , 


-(AE~)(0)  +  /  jV(  0)  =  \  m  :  v  (  0  ) 


,dv, 


(Newton  type) 


(AEJ^) (<)  +  r2v(t)  =  Vm2v<€) , 


The  problem  of  finding  \  and  v(x)  *  0  satisfying  (1.8)  and  a 
boundary  condition  (1.9)  of  Dirichlet,  Neumann,  or  Newton  type  is 
called  an  eigenvalue  problem .  \  is  called  an  eigenvalue  and 

v(x)  a  corresponding  eigenfunction,  or  eigenvector ,  of  the  prob¬ 
lem,  and  (\,v)  is  often  called  an  eigenpalr .  If  V  is  present 
in  one  or  both  of  the  boundary  conditions,  the  problem  is  referred 
to  as  a  Steklov-type  eigenvalue  problem. 

For  the  sake  of  definiteness,  let  us  suppose  we  have  a  Newton 
type  boundary  condition  at  0  and  a  Dirichlet  type  at  f,  and 


further  assume  that  m^  =  0.  Thus  we  are  considering  the  initial¬ 


boundary  value  problem 

(1.3') 

(1.4c') 

(1.4a') 


d  du  <3  ^u  „ 

"ax( AEox)  +  cu  =  ~mA772'  0  <  x  <  t  >  0 

a  t 


,r>  U  , 


-(AE^)  (  0  ,  t  )  +  ^  u  (  0  ,  t )  =  0, 


uK,t)  =  0,  t 


(1.5') 


u(x,0)  =  r  (x) 

^(x,0)  =  r2(x)  ,  0  <  x  <  €. 


The  corresponding  eigenvalue  problem  is 


(1.10) 


-4—  (AE^)  +  cv  =  \mAv,  0  <  x  <  f. 
dx  dx 

■  -(AE^~)  (0)  +  >  x  v  (  0  )  =  0 
V(€)  =  0. 


It  is  knowp  that  problems  of  this  type  have  a  sequence  of 
eigenvalues 
(1.11) 

and  corresponding  eigenfunction 


(1.12) 


v1 (x) .  v2 (x) ,  ...  . 


The  eigenfunctions  satisfy 


(1.13) 


t. 

m(x)A(x)v.  (x)v.(x)dx  = 

0 


where  $^  =  1  if  i  =  j  and  8^=0  i  *■  j ,  i.e.,  they 

are  orthonormal;  in  addition  they  are  complete  in  L2  ,  i.e.,  any 

function  h(x)  e  L>2  can  be  written  as 

00 

(1.14)  h  (  x )  =  Xcjvj(x)' 

j  =  l 


where 

(1.15) 


e. 

c .  =  mAhv  .dx 
3  3 

0 


and  the  convergence  is  in  the  L  -norm.  Regarding  (1.11) 


(1.15) 


Corresponding  to  each  A  we  solve 

2 

(1.165  2”(t)  +  *  .  w  ( t )  =  0,  t  >  0 

dt  J 

(cf.  (1.6)),  obtaining 

w ( t )  =  w  (t)  =  aH  sin  /\~(t+0  ), 

J  J  J  J 


where  a^.  and  0  are  arbitrary.  Thus  the  separated  solutions 
are  given  by 


(1.17) 


a  j V  j  ( x )  sin  A~(t-w?..),  ^  = 


It  is  immediate  that 


<x> 


(1.18) 


u(x.t)  =  ^ajvj<x>  sin  /TT(  t+e.. ) 


3  =  1 


is  a  solution  of  (1.3'),  (1.4c'),  (1.4a')  for  abitrary  a  and 
Qy  provided  the  series  converges  appropriately.  It  remains  to 
satisfy  the  initial  conditions  (1.5').  For  this,  a^  and  ^ 
must  satisfy 

u  ( x ,  0 )  =  ^\a  j  sin  /T7e..v..(x)  =  i^x), 

§^(X,0)  =  cos  jVjtx)  =  r  2  ( x )  . 


From  the  complete  orthonormality  of  the  v^(x)  we  see  that  these 
two  equations  uniquely  determine  a^  and  a  .  Thus  (1.18),  with 
this  choice  for  a^  and  0  is  the  unique  solution  of  (1.3'), 
(1.4c'),  (1.4a'),  (1.5). 


The  simple  motions  given  in  (1.17)  are  called  the  eigenvi- 
brations  of  (1.3),  (1.4c'),  (1.4a).  All  the  points  x  of  the 


j  "  eigenvibrations  vibrate  with  the  same  (circular)  frequency 
(defined  to  be  the  number  of  vibrations  per  2 n  seconds)  and 


phase  displacement  /XT© ^  and  the  point  x  vibrates  with  ampli¬ 
tude  proportional  to  v.(x).  Thus  is  the  frequency  with 

1  J 


which  the  j 


eigenvibration  vibrates  and  v^(x)  gives  the  basi 


shape  of  the  eigenvibration.  The  amplitude  factor  a^  and  9^ 
are  determined  by  the  initial  position  and  velocity  of  the  eigen¬ 
vibration,  whereas  \ ^  and  v^(x)  are  determined  by  the  physical 
process  itself,  as  represented  by  (1.3'),  (1.4c'),  and  (1.4a'). 

We  have  seen  that  any  motion  of  (1.3'),  (1.4c'),  (1.4a')  can  be 

written  as  a  sum  or  superposition  of  eigenvibrations. 

So  far  we  have  been  dealing  with  free  vibrations,  i.e.,  we 
have  assumed  f(x,t)  and  R  in  (1.3)  are  zero.  Now  we  briefly 
consider  the  case  when  f  *  0  and  R  *  0,  i.e.,  the  case  of 

forced  vibrations.  If  we  write 

TO 

f  ( x ,  t )  =  ^fi(t)vi(x)m(x)A(x), 

3=1 

00 

then  we  easily  see  that  u(x,t)  =  y  .a.(t)v.(x)  is  a  solution  if 

J  J  3 
3=1 

a"(t)  +  *  ja j ( t )  =  f j(t)  . 


If,  now,  f.(t)  =  sin  /T7(t+<9.),  then  we  see  that  a.(t),  and 
111  l 

hence  u(x,t),  will  be  unbounded  as  t — .  This  phenomena  is 
called  resonance  and  f  is  called  a  resonant  load;  the  resonant 


frequencies  are 

1  = 

1,2,.. 

• 

The  damping 

term  R 

could 

be  defined  in 

various  ways. 

For 

example,  we  could  take  R 

to  be 

du  c 

/'•rr,  for  a 

constant  /< , 

which 

would  lead  to  a  term  of  the  form  in  equation  (1.3)., 

Eigenvalue  problems  similar  to  (1.8)  and  (1.9)  or  (1.10) 
arise  in  a  number  of  other  situations.  We  now  briefly  mention 
some  of  them. 

The  Transverse  Vibration  of  a  String 

We  are  interested  here  in  the  small,  transverse  vibration  of  a 
homogeneous  string  that  is  stretched  between  two  points  a  distance 
1  apart.  Gravity  is  assumed  to  be  negligible  and  the  particles 
of  the  string  are  assumed  to  move  in  a  plane.  We  denote  the  den¬ 
sity  of  the  string  by  r  and  the  tension  by  p.  We  restrict  our 
attention  to  the  case  of  free  vibrations. 

If  the  particles  of  the  string  are  identified  with  the  num¬ 
bers  0  <  x  <  f-  and  if  u(x,t)  denotes  the  vertical  displacement 
of  the  particle  x  at  time  t,  then  u  satisfies 


(1.19) 


'-pi-u^t).  =  -rijMx^t),  o  <  x  <  ,,  t  >  0 
ax  at 

U ( 0 , t )  =  U(€,t),  t  >  0. 


We  see  that  (1.19)  is  a  very  special  case  of  (1.3)  and  (1.4a). 
The  associated  eigenvalue  problem  is 


(1.20) 


) -C2v" (x)  =  \ v ( x ) ,  0  <  x  <  € 
]v(0)  =  v(€)  =  o. 


where 


p/r.  It  is  easily  seen  that  the  eigenvalues  and 


eigenfunctions  of  (1.20)  can  be  given  explicitly;  they  are 


(1.22)  vk(x)  =  /2/t  sin  — ,  k  =  1,2 . 

The  entire  discussion  of  the  elastic  bar  -  i.e.,  the  discus¬ 
sion  of  separation  of  variables,  of  eigenvalues  and  eigenfunctions, 
and  of  eigenvibrations  —  applies  to  this  problem.  We  note  that  it 
is  possible  to  find  the  eigenvalues  and  eigenfunctions  explicitly 
only  in  very  special  situations,  roughly,  just  in  the  case  of 
eigenvalue  problems  for  differential  equations  with  constant  coef¬ 
ficients  in  one  dimension.  In  general,  one  must  resort  to  approx¬ 
imation  methods.  The  discussion  of  such  methods  is  the  main  topic 
of  this  article. 


Characterization  of  the  Optimal  Constant  in  the  Poincare  Inequality 
The  Poincare  inequality  states  that  there  is  a  constant  C 
such  that 


(1.23) 


l  l 

[  u  ( x )  ]  2dx  <  C  [u'(x)  ]  2dx 
0  0 


for  all  functions  u(x)  having  a  square  integrable  first  deriva¬ 
tive  and  vanishing  at  0  and  f.  Let  us  consider  the  problem  of 
finding  the  minimal  constant  C.  We  are  thus  interested  in 


(1.24) 


Using  the  elementary  methods  of  the  calculus  of  variations  we  find 
that  the  function  u  achieving  the  supremum  in  (1.24)  satisfies 


C 


f 


u'v'dx 


uvdx 


for  all  v  having  square  integrable  first  derivatives  and  vanish 
ing  at  0  and  By  integration  by  parts  we  then  find 


(1.25) 


-u"  =  ±U,  0  <  X  <  l 
u(0)  =  u(€)  =  0. 


Thus  1/C  is  lowest  eigenvalue  of  the  eigenvalue  problem  (1.25), 
and  the  optimal  u  in  (1.24)  (which  achieves  equality  in  (1.21)) 
is  an  associated  eigenfunction. 

B.  Higher  Dimensional  Problems 


The  Vibrating  Membrane 

Consider  the  small,  transverse  vibration  of  a  thin  membrane 
stretched  over  a  bounded  region  Q  in  the  plane  and  fixed  along 
its  edges  T  =  dft .  The  vertical  displacement  u(x,y,t)  of  the 
point  (x,y)  in  ft  at  time  t  satisfies 


(1.26) 


-AU  =  - 


d  2u  _  d2u 
dx2  dy2 


(x,y)  e  Q,  t  >  0 


u( x , y , t )  =  0,  ( x , y )  c  dft,  t  >  0. 


As  with  the  vibrating  elastic  bar  or  the  vibrating  string,  if  we 
seek  separated  solutions  of  the  form  u(x,y,t)  =  v(x,y)w(t),  we 
are  led  to  the  eigenvalue  problem  of  finding  \  and  v(x,y)  *  0 
satisfying 


(1.27) 


-Av  =  \v,  (x,y)  e  ft 
v ( x , y )  =  0,  (x,y)  s  dft. 


and  for  each  eigenpair  (\ ,v)  of  (1.27),  to  the  differential 
equation 


V  »Vh"W  A  '  Vw’f  V.N-  ''A  "w'i  V*  O  « 


’  v“*  O.  ^ 


(1.28) 


■(t)  +  \  w  { t )  =  0,  t  >  0, 


for  w(t)  (cf.  (1,16)). 


It  is  known  that  (1.27)  has  an  infinite  sequence  of  eigen¬ 


values 


and  corresponding  eigenfunctions 


v1(x,y),  v2(x,y) . 


The  eigenfunctions  are  complete  and  orthonormal  in  L2(ft) 


ajVj(x,y)  sin  /TT(t+0^),  j  =  1,2,...,  are  called  eigenvibrations . 


/T^  is  the  frequency  and  v^(x,y)  is  the  shape  of  the  j* 


eigenvibration.  All  solutions  of  (1.26)  can  be  obtained  as  a 


superposition  of  eigenvibrations  (cf.  (1.18)).  We  note  that  if. 


instead  of  fixing  the  membrane  on  T ,  we  allowed  it  to  move  free¬ 


ly  in  the  vertical  direction,  then  we  whould  have  the  Neumann  boun- 


^  VI  ^ 

dary  condition  =  0,  where  5—  denotes  the  outer  normal  deri- 

ct  n  on 


vative,  instead  of  the  Dirichlet  condition  u  =  0.  The  approxima¬ 


tion  of  the  eigenpairs  of  a  membrane  is  discussed  in  Subsection 


10. B.,  11. B.,  12. A.,  and  12.B. 


The  Problem  of  Heat  Conduction 


Consider  the  problem  of  heat  conduction  in  a  body  occupying  a 
region  Q  in  three-dimensional  space.  We  suppose  the  temperature 


distribution  throughout  9  is  known  at  time  zero,  the  temperature 


is  held  at  zero  on  < for  all  time,  and  that  we  want  to  deter¬ 


mine  the  temperature  u(x,y,z,t)  at  the  point  (x,y,z)  ~  U  at 


time  t  >  0.  From  the  fundamental  law  of  heat  conduction  we  know 


m 


s 

to 


V  V  V 


it.lV  iV  at, 


(1.29) 


"9x  P  X,Y'Z  9x  -  5y(P(x*Y'z)5y)  -  ^(P(x.Y.z)^) 

c?  u 

=  -r(x,y,z)^,  ( x ,  y ,  z )  e  Q,  t  >  0 

u(x,y,z,t)  =  0,  ( x, y , z )  e  <9Q  ,  t  >  0 
u(x,y,z,0)  =  f (x,y,z) ,  (x,y,z)  e  Q , 


where 


f(x,y,z)  =  the  temperature  distribution  at  t  =  0, 
p(x,y,z)  =  the  thermal  conductivity  of  the  material  at 
(x,y , z ) , 


r(x,y,z)  =  density  of  the  material  times  the  specific  heat  of 
the  material. 

If  we  seek  separated  solutions 

u(x,y,z,t)  =  v(x,y,z)w( t) 

of  the  differential  equation  and  the  boundary  conditions  in  (1.29) 
we  are  led  to  the  eigenvalue  problem 


(1.30) 


d  .  dv.  d  .  dv.  d  .  dv.  .  ,  . 

<5x  ^*5x  c5y  P9y  dz^^dz  ~  rv' 

v(x,y, z)  =  0,  (x,y, z)  e  dQ  , 


and  for  each  eigenpair  (A.,v)  of  (1.30)  we  are  led  to  the  equation 


(1.31) 


w'  +  \w  =  0,  t>0 


for  w(t)  (cf.  (1.16)  and  (1.28)).  (1.30)  has  eigenvalues 


and  eigenfunctions 


0  <  J  i  <  \  2  s 


vx , v  , . . . 


satisfying 


I 

y 

K 

[ 

I 

f 

i 

i 

* 

t 


viv^.r  dx  dy  dz  =  6^. 


Q 


Corresponding  to  each  *  ^ ,  from  (1.31)  we  find  w(t) 
a^e-*-**.  Thus  the  separated  solutions  are  given  by 

a^  (x,y,z)e-lj  t ,  j  =  1,2,..., 


V1' 


and  the  solution  of  (1.29)  is 


00 


(1.32) 


u(x,y,z,t) 


-I 


j  =  l 


fv^r  dx  dy  dz 


0 


,  .  -X  i  t 

Vj  (x,y  ,z)e 


(cf.  (1.18).  We  note  that  from  (1.32)  and  the  positivity  of  the 
eigenvalues,  one  can  show  that  lim  u(x,y,z,t)  =  0  and  that  the 

t-*OD 

rate  at  which  the  temperature  u  decays  to  zero  is  largely  deter 
mined  by  \ 1 . 

The  Vibration  of  an  Elastic  Solid 

The  vibration  of  an  elastic  solid  Q,  the  three-dimensional 
generalization  of  the  elastic  bar,  is  governed  by  the  Navier-Lame 
equations 


(1.33) 


(X+/J)5x  +  ^Au 

•  (X+p)£y  +  /IAV 


80 

('+/,)az  +  /,Aw 


-x  +  p 


a2u 

at2 


-y  +  p 


a2v 


at' 


-z  +  p- 


a2 

a  w 


at 


2 ' 


(x,y,z)  e  Q ,  t  >  0, 


where  u(x,y,z,t),  v(x,y,z,t),  and  w(x,y,z,t)  are  the  x,y, 
and  z-components  of  the  displacement  of  the  point  (x,y,z)  Q 


du  dv  dw 

at  time  t,  0  =  ^  ^  X,Y,  and  Z  are  the  components  of 

the  external  force  per  unit  volume  acting  at  (x,y,z),  \  >  0,  and 
/i  >  0  are  the  Lame  elastic  constants,  and  p  is  the  density  of 
the  material . 

As  in  the  case  of  the  bar,  boundary  conditions  of  various 
types  may  be  prescribed.  For  example,  the  Dirichlet  boundary  con¬ 
ditions  prescribe  the  values  of  u,v,  and  w  on  r  =  dQ  .  Neumann 
conditions  are  more  complicated.  Let  n  be  the  unit  outer  normal 
to  T,  let  nx,ny'  and  nz  be  the  x,y,  and  z-components  of 
n,  and  let 

a  _  a  a  ^  a 

cTn  nx  cfx  +  ny  oTy  nz  <Tz 
be  the  outer  normal  derivative.  Then  define 


( 1.34a) 


Xn  =  ^nx  +  4?  +  nx  +  U  V  55  nz] 


,,  ,,  dv  .du  dv  flw 

(1.34b)  Yn  -  \0ny  +  fi^n  +  ^dy  nx  +  ^  ny+  nz3 

(1.34c)  Z  =  \dn  +  n  f  7T~  n  +  n  ]• 

n  z  dn  ^'■a  z  x  dz  y  dz  zJ 

The  Neumann  conditions  then  consist  in  prescribing  xn>Yn>  and 

Zn  on  the  boundary.  One  can  also  mix  the  boundary  conditions  in 

various  ways,  e.g.,  impose  Dirichlet  conditions  on  one  part  of  the 

boundary  and  Neumann  conditions  on  the  remainder  of  the  boundary 

or  prescribe  X  ,  Y  ,  and  w  on  F . 
n  n 

The  eigenvalue  problem  associated  with  (1.33)  is  given  by 


kd<9 

-<*♦''>55 


-  /'lU  =  Mp  U 


-  jp |V  =  Mpv 


as 


-  n A  w  =  Mpw ,  (  x ,  y  ,  z  ; 


(1.35) 


where  we  have  denoted  the  eigenvalue  parameter  by  <•>  (to  avoid 

confusion  with  the  Lame  constants  /.<  and  *  )  ,  and  where  here 

u,v,w,  and  0  denote  functions  of  x,y,  and  z  only,  i.e.,  the 

separation  of  variables  has  been  written  as  u(x,y,z,t)  =  u(x,y,z) 

T(t),  etc.  For  boundary  conditions  we  can  consider  any  of  those 

mentioned  above.  If  we  consider  Dirichlet  conditions  (u  =  v  =  w 

=  0  on  r )  we  refer  to  the  clamped  solid  and  if  we  consider 

Neumann  conditions  (X  =  Y  =  Z  =0  on  r )  we  refer  to  the 

n  n  n 

free  solid. 

The  approximation  of  the  eigenvalues  of  the  free  L-shaped 
panel  (a  two  dimensional  analogue  of  the  elastic  solid)  is  treated 
in  detail  in  Subsection  10. A. 

The  Steklov  Eigenvalue  Problem 

The  Steklov  eigenvalues  of  the  differential  operator  -A  +  I 
are  those  numbers  A  such  that  for  some  nonzero  u, 

{-A  u  +  u  =  0  in  0 

=  Au  on  T  =  00  . 
dn 

Problems  of  this  type,  in  which  the  eigenvalue  parameter  appears 
in  the  boundary  condition,  arise  in  a  number  of  applications  (cf. 
(1.9b)  and  (1.9c)). 

The  Problem  of  Stability  of  a  Nonlinear  Problem 
Consider  the  quasilinear  parabolic  problem 

r)u  <^U 

^  -  A u  +  =  0,  ( x , y )  e  Q ,  t  >  0 

u  (  x ,  y ,  t )  =  <p  ( x ,  y )  ,  ( x ,  y )  •  dQ  ,  t  >  0  . 


Suppose  u(x,y)  is  a  stationary  solution,  i.e.,  suppose 


If  the  term  N  in  (1.36)  is  neglected,  then  this  result  is  simi¬ 
lar  to  that  mentioned  at  the  end  of  the  discussion  of  heat  conduc 
tion.  Note  that  L  is  a  nonself ad joint  operator  and  its  eigen¬ 
values  will,  in  general,  be  complex  (cf.  Section  3).  For  further 
detail  on  this  type  of  stability  results  see  Prodi  [1962]. 


Section  2.  Sobolev  Spaces 

The  natural  setting  for  a  discussion  of  eigenvalue  problems 
and  their  approximation  is  the  theory  of  linear  operators  on  a 
Hilbert  space.  In  this  section  we  will  sketch  the  definitions  and 
basic  properties  of  the  function  spaces  we  will  make  use  of. 

These  are  mainly  the  Sobolev  and  Besov  spaces. 

Let  Q  be  a  bounded  open  subset  of  Rn  and  denote  by  x  = 

)  a  point  in  Rn.  For  each  integer  m  >  0,  the  real 
^ complex)  Sobolev  space  Hm(Q)  is  defined  by 

(2.1)  Hm  =  Hra(Q)  =  (u  :  aau  €  L.  (0 )  V  |cr|  <  m)  , 

where  L2(Q)  denotes  the  usual  space  of  real  (complex)  valued 
square-integrable  functions  on  Q  equipped  with  the  inner  product 


(u.v)  *  ( u , v ) 


L2(0) 


(2.2) 

and  norm 
(2.3) 

On  Hm(Q)  we  have  the  inner  product 


uvdx 


1  U  =  !'  u  i*  —  i 

L2(Q)  1 


O 


.2.  v1/2 

u  |  dx) 


Q 


(2.4) 

and  norm 

(2.5) 


<(u’v))m  =  l(u’v)lm.Q  -  Loh 


m 


d  uo  v  dx 


0 


!u"  =  it  u  • 

Hm(Q)  m 


U  m,Q  ^|a  |  < 


m 


[aau| 2dx) 1/2 


0 


With  this  inner  product,  Hm(Q)  is  a  Hilbert  space.  Here  <»  = 


■  L'V 


( a ^ , . . . , o ^ ) ,  with  the  a ^  a  nonnegative  integer,  h 

and  a°u  =  a ^ °  I u/ax^1  ,  .  .  .  ,dx° n .  We  also  have  the  semi-inner  pro- 

1  n 


duct 


(2-6)  (U'V)„m,^  =  (U'V)m  =  (u'V)m,0  =  2.1 

n  [U  ) 


a  =m 


3  u3  vdx 


0 


and  serai-norm 
(2.7)  |u 


=  I  u  I  =  I  u  |  _ 

( 0  )  ^ 


^  |  a  |  =m 


,  . 2  ,  .1/2 
|3  u |  dx ) 


It  is  immediate  that  H  (Q)  =  L„(Q)  and  :!uM 


0,0 


=  I  u  | 


0,0 


!!  u!!  _  ,  „  .  .  If  T  =  30  is  Lipschitz  continuous,  then  Cm(0)  is 
L2(Q) 

dense  in  Hm(Q).  (T  is  called  Lipschitz  continuous  if  it  can  be 
locally  represented  by  a  Lipschitz  continuous  function,-  see  Necas 
[1967]  for  further  details.) 

Hq (0 )  is  defined  as  the  closure  in  H1(0)  of  0^(0),  the 
space  of  infinitely  differentiable  functions  on  0  which  vanish 
near  f.  The  Poincare  inequality,  which  states  that 


(2.S)  |u|0,0  '  C|U| 1,0'  V  u  e  H0(0) ' 

i  — 

shows  that  I  •  I  ^  ^  is  a  norm  on  H^(Q).  HQ(Q)  is  c^osure  in 

Hm(0)  of  C*(Q). 

If  T  is  Lipschitz  continuous,  then  we  can  define  the  space 

L 2 ( T )  ,  which  consists  of  functions  u  defined  on  F  for  which 
f  2  1/2 

'U'L  (T  )  =  ^  J  )  <  1X1  <  where  ds  denotes  the  surface  area 

L„ ( r )  is  a  Hilbert  space  with  inner  product  (u,v)  =  [  uvds . 

l2(D  r 

It  is  also  known  that  a  function  u  €  H*  (Q  )  has  a  well-ri?fined 
restriction  to  f,  denoted  by  tr  u,  in  the  sense  of  trace;  u  = 


tr  u  satisfies 


Hq(0)  =  {u  H1(0)  :  u  =  0  on  F  in  the  sense  of  trace}. 

Furthermore,  a  function  u  C1  (0  )  is  in  H^(0)  if  an  only  if 
u  =  0  for  all  xu  r .  We  note  that  if  F  is  Lipschitz  contin¬ 
uous,  then  the  normal  vector  n  is  defined  almost  everywhere  on 


I  .  The  outer  normal  derivative  4—  is  defined  for  u  -  H2(u)  . 

dn 

H2(0)  =  {u  u  H2(Q)  :  u  =  ^  =  0  on  r>. 

We  shall  occasionally  make  use  of  the  vector  valued  Sobolev 
spaces  :-m(Q)  which  are  defined  by 

(2.10)  -m(0)  =  ( ( u : ( x ) . uk(x))  :  Uj(x)  u  Hm(0),  j  =  1 , . . . , k } 

and 


(2.11) 


2 

rm(0  ) 


U 


1  m ,  Q 


2 

uk  m ,0  ' 


In  the  study  of  eigenvalue  problems,  central  use  will  be  made 
of  Rellich's  theorem  (cf.  Agmon  [1965]),  which  states  that  every 

bounded  sequence  in  H  (0 )  has  a  subsequence  which  converges  in 

i  n 

HJ  (0  )  if  j  <  m,  provided  Q  is  a  bounded  open  set  in  R  with 

a  Lipschitz  continuous  boundary. 

So  far  we  have  defined  the  Sobolev  space  H‘n(0)  only  for  m 

an  integer.  We  will  sometimes  use  Hm(0),  for  m  fractional 

and  also  the  Besov  spaces,  so  we  now  turn  to  their  definition. 

using  the  K-method. 

For  u  Hm(0)  and  0  <  t  <  v  set 


(2.12) 


K  (  u  ,  t  )  = 


inf 

m  m+ 1 
v  H  ,  w  H 

v  +  w=u 


{  v  ,  +  t  w 

m  ,  0  m  +  1 


Then  for  m  ^  k  <  m+1  define 


(2.13)  :  u 


Hk(0)  k  k,0 


.  -O 


[t  K  {  t ,  u )  ]  2  ^)1/2 


and 

(2.14) 


u 


0 . 


=  sup  { t  K(u, t ) } , 


H  (Q )  0<t<» 

where  0  =  k-m .  The  space 


(2.15) 


Hk(Q )  5  (u  e  Hm(Q )  :  hum  .  <  *} 

HK(Q  ) 


is  the  Sobolev  space  with  fractional  order  k  and 
(2.16) 


Hk(Q)  s  (u  e  Hm(Q)  :  „U:  ..  <  * } 

H  (0  ) 


is  a  Besov  space,  the  one  often  denoted  by  B, 


2,  x, 

In  order  to  fix  these  ideas  and  to  obtain  a  fact  we  will  use 


in  the  sequel  (cf.  Subsections  10. A.  and  10.B.),  we  now  consider 
the  function 


a 


u  =  r  ,  for  (r,6>)  s  S  =  {  (  r  ,  0  )  :  0  <  r  <  1,  0  •  o  '>  2 n  )  , 


where  -1  <  a  <  0,  (r,0)  being  polar  coordinates,  and  pr'v/e  the. 


u  -=  H°  +  1  (S) 


Theorem  2.1.  For  -1  <  a  <  0,  we  have 


u  =  r  '  H 1  +< '  {  S  )  . 


Proof .  Let  <p{x),  0  <  x  <  *,  be  a  function  having  derivatives  o 
all  orders  and  satisfying 


<p(x)=0  for  0<x<12, 
<p  ( x )  =  1  for  1  <  x  <  '  . 


V  =  [  1  -</>  ( -  )  ]  u  , 


■ 


■  j* 


For  0  <  <■>  -  1,  define 


V 


w  =  <o  (  — )  u  . 

o 


Then  we  obviously  have  u  =  v+w.  Now 


2  .  2a  + 1 

^  r  dr 

H°<»)  °l0 

2a +  2 


2a +  2 


2i  =  [|w|2  +  |2  +  '§£  |2|dxidx2 

H1(S)  J-1-  1  X2  J 

o 


r^or1 


o  J  o 


,  2  ^  ,aw.  2  ^  -2  .aw.  21  ,  ,,, 

w|  +  ijpl  +  r  l^l  r  drd^ 


,2  .aw.21 

W|  +  r  dr 


_r  2<<  +  l,  ^  2 

C[  r  dx  +  a  r  dr 


<  * 

r  -2  f  2a  +  1 

•*•  <  >  r  d  r  J 

X  /  O 


C*  20  , 


with  C  independent  of  a.  Hence 


and  thus 


a  + 1  n 

K  (  u  ,  t )  C  [  <■>  +  to  ] 


WWW 


If  0  <  t  <  1,  let  6  =  t  to  get 


and  hence 


(a  +  1 ) 


K(u, t) 


2C 


sup  {t  (a+1)K(u,t) }  <  2C. 
0<t<l 


If  t>l,  we  obviously  have 


K(u,t)  =  i  u  !  <  C' 

HU{0) 

and  hence 

sup  (t"(a  +  1)K(u,t)  }  <  C' . 
1  <  t  <x 


Therefore 

:  U:  .  =  sup  {t“(1+<‘')K(u,t)  }  s  C"  <  x 

'(S)  0<t<x 

and  hence  u  e  H1+a(S),  as  was  to  be  proved. 

In  a  similar  way,  one  can  also  prove  that  r°  e  H1+a(S)  for 
o  >  0,  not  an  integer.  Finally  we  note  that  r‘  =  H1+U(S),  but 
r  &  H  (S)  for  any  e  >0. 

For  a  complete  discussion  of  the  Sobolev  and  Besov  spaces  we 
refer  to  Adams  [1975],  Necas  [1967],  and  Butzer  and  Barens  [1967] 


Remark  2.1.  The  definition  of  the  Sobolev  spaces  with  fractional 
index  m  has  a  very  simple  interpretation.  For  u  to  be  in 
H*+<,(S)  means  that  for  any  0  <  t  <  x,  u  can  be  split  into  the 
sum  of  a  smooth  function  and  a  nonsmooth  function  in  a  natural 
way,  We  have  employed  this  natural  splitting  in  the  proof  of 
Theorem  2.1  and  we  will  use  it  in  the  sequel. 

So  far  we  have  considered  only  one  special  family  of  Sobolev 


spaces  or  Sobolev-type  spaces. 


Several  other  families  are  impor- 
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•tant  in  various  situations.  For  example,  if  Q  c  r  with  0  ~ 


8Q ,  and  if  0  <  3  <  1  and  m  a  i  *  1 ,  we  can  define 


(2.17) 
,m ,  & 


h;‘‘'~(0)  =  {u  £  h€  1  (0 )  :  (dau)r'?+,a|  1  - 


t  h2(0)  for  / 


a 


m} 


and 


m 


2.18 


u 


H™'*{0 ) 


=  u  e-i  + 
H  (0) 


I 

la ) 


,  .  <  + 1  a  I  2 

{a  u,r  l2(0 


2  2  1/2 

where  r  =  (x^  +  ^ 


Spaces  of  this  kind  are  called  weighted 
Sobolev  spaces.  For  more  details  we  refer  to  Kufner  [1985].  Con¬ 


sider  the  function  u  =  r7  ,  with  0  <  y  <  1  .  One  can  show  that 


Tm  ,1 


u  £  H  .j  '  ( 0 )  ,  where  0  =  {  ( r  ,9 )  :  0  <  r  <  1},  for  1  >  1  -;  ,  m  2 


and  l  =  2 .  In  fact,  since  |dauj  s  C(a)r/ 


we  have 


|dau|r?+*Ct^  2  <  r7  +  i  2,  and  we  see  that  u  £  H^'^(Q)  for  m,t’, 


and  3  as  given. 

We  will  also  have  occasion  to  use  countably  normed  spaces 
constructed  from  Sobolev  spaces.  For  example,  consider  the  space 


(2.19)  ®2(0)  =  (u  £  H2,2(0)  :  :  (d<Ju)r l<J  1 '2+? 


Cd  ' U  *<)  ! 


l2(0 


for 


>  2,  with  C  and  d  independent  of 


It  is  easy  to  see  that  all  functions  u  £  B*( Q)  are  analytic  in 

-? 


0  -  (0).  The  function  r  considered  above  belongs  to  fi“(0) 
for  3  >  1-;  .  We  have  here  only  considered  weights  with  respect 
to  the  origin.  More  generally,  one  can  consider  weights  with  res¬ 
pect  to  the  vertices  of  domains  with  piecewise  smooth  boundaries. 

An  important  reason  for  introducing  these  spaces  is  to  characterise 
the  solution  (eigenfunctions)  of  a  problem  as  precisely  as  possible 
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by  embedding  it  (them)  in  as  small  a  space  as  possible.  There  are 
other  classes  of  function  spaces  that  are  important  in  various 
contexts,  but  we  will  not  go  further  in  this  direction. 

Remark  2.2.  We  have  followed  the  usual  custom  of  using  the  same 
notation  for  real  and  complex  function  spaces.  It  will  be  clear 
from  the  context  which  version  we  are  using.  See  Remark  4.1. 


In  Section  1  the  eigenvalue  problems  were  stated  in  classical 
form,  i.e.,  we  were  seeking  an  eigenvalue  >  and  a  corresponding 
nonzero  eigenfunction  u(x)  such  that  the  eigenvalue  equation  and 
boundary  condition  were  satisfied  in  the  classical  pointwise  sense 
These  problems  can  alternately  be  given  variational  formulations. 
Since  finite  element  approximation  methods  are  most  naturally 
defined  in  terms  of  variational  formulations  we  now  briefly  indi¬ 
cate  how  eigenvalue  problems  can  be  cast  j.n  variational  form.  We 
will  do  this  by  discussing  2nd  order  elliptic  eigenvalue  prob¬ 
lems  in  two  dimensions  in  some  detail.  We  begin  by  describing 
this  type  of  problem. 

Consider  the  problem: 

Seek  a  real  or  complex  number  \  and  a  nonzero  real  or 
complex  valued  function  u(x)  satisfying 


(3.1) 


(  Lu )  ( X )  =  t  (Mu)  (x)  ,  x  0 
( Bu ) ( X )  =  0 ,  x  c  f  -dQ , 


where  0  is  a  bounded,  open,  connected  set  in  R4, ,  and 

2  2 

(3.2)  Lu  ( x )  =  -  )  I )  .  (  a  .  .  ( x ) <>  .  u  )  +  /  b  .  (  x ) d  .  u  +  c  (  x )  u  ,  ( -<  . 

/■  i  .  1  t  J  t  i  i 

i , j  =  l  i  =  1 


u  . 

l 


where  a^fx)  =  a^fx),  b^x),  and  c(x)  are  given  real  or  com 
plex  functions  on  o, 


(3.3) 


Mu ( x )  =  d ( x ) u ( x ) , 


where  d(x)  is  a  given  real  function  which  is  bounded  below  by  a 
positive  constant  on  o,  and 


(3.4) 


(Bu) (x)  =  i  2 

-  /  a  .  .n  .d  .  u , 

Z_  ij  1  l 

i,  j  =  l 


where  n(x)  =  ( , n2 )  is  the  exterior  unit  normal  to  r  =  00  at 
x.  L  is  assumed  to  be  uniformly  strongly  elliptic  in  0,  i.e., 
there  is  a  positive  constant  aQ  such  that 


(3.5)  Re 


X  ai3<x)?il;  j  8  •oX'l-  V  x  * 


0  and 


R2. 


i- j  =  l 


i  =  l 


In  addition,  a^,b^,c,  and  d  are  assumed  to  be  bounded  and 

measurable.  (A  portion  of  the  theory  of  eigenvalue  problems  can 

be  developed  under  the  more  general  hypothesis  that  d(x)  is 

merely  assumed  to  be  a  bounded,  measurable,  complex  function,  but 

we  will  not  pursue  this  direction. ) 

(A,u)  is  called  an  eigenpair  of  the  2nd  order  differential 

operator  L  (relative  to  the  0th  order  differential  operator 

M) .  If  Bu  =  u,  the  boundary  condition  Bu  =  0  is  the  Dirichlet 

2 

a,  .n.O.u  =  -$—  =  the  conormal  deriva- 
i  j  j  i  o  V' 

i  .  1  =  1 

tive  of  u,  then  Bu  =  0  yields  the  Neumann  condition. 

It  is  immediate  that  all  of  the  examples  discussed  in  Sec¬ 
tion  1  —  except  the  Steklov-type  eigenvalue  problems  and  the  prob¬ 
lem  of  the  vibration  of  an  elastic  solid  -  are  of  the  form  (3.1) 
or  its  one  or  higher  dimensional  analogues.  In  any  case,  our  dis¬ 
cussion  of  approximation  methods  will  be  in  terms  of  an  abstract 
framework  that  will  cover  all  the  examples. 


(3.6) 

and 

(3.7) 


2 

V" 


L  v  ( x )  =  -  y  ^i<ai  jv>  ■  ^-i<9i<biv>  +  cv 


i,  j  =  l 


i=l 


— *  =  -  V  a..n.d.v  -  ^.b.n.v 
e»  lj  1  3  £*iii 

V  •'  *  •  4  J  _  .  4 


i  .  j  =  l 


i  =  l 


L  is  called  the  formal  adjoint  of  L.  It  is  an  immediate  conse 
quence  of  the  divergence  theorem  that 

2  2 

r  r 

(3.8) 


Luvdx  = 


Q 


(  £  a  £  b^d^uv  +  cuv)dx  + 

0  i , j=l  “ 


i  =  1 


du-  , 
vd 

di‘ 


uL  vdx  + 


'Q 


du-  . 
■a-vdx  - 
du 


dv 
i — , 

du 


V  V  . 

u — „ds 


for  all  smooth  functions  u  and  v.  Hence  we  have 


(3.9) 


Luv  dx  = 


Q 


uL  vdx 


Q 


if  either  u  =  v  =  0  on  T  or  3^  =  =  0  on  P  . 

du  * 


du 


♦ 

If  a.,  and  c  are  real  and  b.  =0,  then  L  =  L  and  ^ 
ij  l  d 


du 


In  this  case  we  say  L,M,B  or,  more  briefly,  L  is  for¬ 


mally  selfad joint ,  and  we  have 


(3.10) 


d(^L)uv  dx 


0 


du(^L)v  dx 

0 


du  d  v 

if  either  u  =  v  =  0  on  r  or  —  =  =  0  on  r.  All  of  the 

du  du 

examples  treated  in  Section  1  are  formally  selfadjoint  except  the 
operator  arising  in  the  stability  analysis  of  the  nonlinear 
initial-boundary  value  problem. 
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Now  we  turn  to  the  derivation  of  a  variational  formulation 


5: 


» 


s 


y. 

•2 


v 


it 


V 


X* 

/■/ 

*j»  * 


for  (3.1).  Suppose  (*,u(x))  satisfies  (3.1)  in  the  classical 
sense,  i.e.,  the  differential  equation  and  the  boundary  condition 
hold  pointwise,  and  consider  first  the  case  of  the  Dirichlet  boun¬ 
dary  condition:  u(x)  =  0  for  x  e  r.  We  assume  ft  is  a  bounded 
2 

open  set  in  R  with  Lipschitz  continuous  boundary  F.  Then, 


multiplying  (3.1)  by  v, 

integrating  over  0, 

and  using  ( 

:  3 . 3 ) 

and  (3.8)  we  find  that 

(3.14)  A  b(  u ,  v )  =  A 

duvdx 

= 

Luvdx 

• 

0 

■ 

'ft 

r  2 

2 

Y  aijaiuv 

+ 

+  CUV 

dx 

j 

ft 

Li  ,  j  =  l 

i  =  l 

J 

+ 

du  -  , 

aU  vds 

r 

r  2 

2 

-•» 

' 

= 

f 

/  a  .  .d  . ud  v 
L  i)  i  j 

+  }  b .  d  . uv 

+  CUV 

> 

dx 

ft 

Li  ,  j  =  l 

i  =  l 

/ 

=  a ( u ,  v )  ,  for  all  v  C 1  ( 0  ) 

that  vanish  on  F . 

a ( u , v )  and  b(u,v),  as  defined  in  (3.14),  are  bilinear  forms 

(sometimes  referred  to  as  sesquilinear  forms  in  the  complex  case) 

in  u  and  v.  They  are  clearly  defined  for  u,v  -  C1(0)  and,  in 

fact,  a(u,v)  is  defined  for  u,v  t  H1(0)  and  b(u,v)  for 
0 

u,v*  H  (0)  =  L  (Q).  Furthermore,  using  the  fact  that  a..,b.,c, 

2  i  j  i 

and  d  are  bounded,  it  follows  from  Schwarz's  inequality  that  a 
is  bounded  on  H*(ft)  and  b  is  bounded  on  H^(O),  i.e.,  that 
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(3.15) 

(3.16) 


!  a  ( u,  v)  j  s  C  ^  |!  u 11  ^  ^  !l  v !!  ^  Q,  v  u,v  -  H  (O  )  , 


b(u,  v)  |  <  C  2  i'  u'l  Q  Qi'v!!0  Q,  V  u ,  v  e  H  (0  ) 


We  note  one  further  property  of  the  form  a(u,v): 


—  !: uil  ^  V  u  e  H1(0),  provided 


3°  b2 
Re  c<*>  *  2“  +  2T 


,  for  all  x  s  Q  , 


(3.17)  Rea(u,u)  > 


where  b  =  max  |b.(x)|, 
xe(l 


i=l ,  2 


a0,U'l  0  ?  ci'ul'i  0'  V  u  €  hJ(Q),  provided 


bi(x)  =  0,  i  =  1,2,  Re  c(x)  >  0. 


aQ  here  is  the  ellipticity  constant  in  (3.5);  the  result  follows 


directly  from  (3.5). 


Since  the  eigenfunction  u  vanishes  on  f,  u  e  Hg(Q). 


Thus,  using  (3.15),  (3.16),  and  the  fact  that  (v  C  (Q)  :  v  =  0 


on  r }  is  dense  in  H  (Q),  it  follows  from  (3.14)  that  the 
eigenpair  (X,u)  satisfies 


3.18) 


u  e  Hj(Q),  u  *  0 


a ( u , v )  =  \b(u,v),  V  v  e  Hq(0) 


(3.18)  is  called  a  variational  formulation  of  (3.1).  We  have 
shown  that  if  (x,u)  is  an  eigenpair  in  the  classical  sense  then 
it  is  an  eigenpair  in  the  variational  sense.  We  now  show  that  the 


converse  is  true,  provided  r,a^,b^,c,  and  d  are  sufficiently 


smooth . 

Suppose  (',u)  satisfies  (3.18)  and  suppose  in  addition  Q 


is  a  bounded  open  set  with  Lipschitz  continuous  boundary  r  and 
2  - 

that  u  e  c  (ft).  Then  from  the  equation  in  (3.18)  and  from  (3.8) 
we  have 


(3.19) 


ft 


Luvdx-  =  a(u,v)  + 

=  a(u,v) 

=  X.  b  ( u ,  v ) 


du  -  , 
tt—  vds 

dv 


ft 


=  X 


duvdx,  V  v  e  C1(Q)  that  vanishes  on  r 


0 


Since  (v  e  C1(0)  :  v  =  0  on  T)  is  dense  in  L2(ft)  we  see  from 

(3.19)  that 


Lu(x)  =  X. Mu ( x )  ,  xe  0 


2  -  1 

Also,  since  F  is  Lipschitz  continuous  and  u  e  C  (ft)  ^  we 

know  that  u(x)  =  0  for  all  x  e  T.  Thus  we  see  that  (X  ,u)  is 
an  eigenpair  in  the  classical  sense. 

2  - 

We  next  present  conditions  that  guarantee  that  u  e  C  (ft). 
From  (3.18)  we  see  that  u  is  a  weak  solution  of  the  boundary 
value  (source)  problem 

Lu  =  f  in  ft 
u  =  0  on  r  , 


where  f  =  x du .  Using  standard  regularity  results  for  elliptic 

2  - 

equations  we  find  that  u  e  C  (ft)  provided  r.a^.b^.c,  and  d 
are  sufficiently  smooth.  In  the  two-dimensional  case  we  are  dis¬ 
cussing  it  is  sufficient  to  assume 


•  f  is  of  class  C4, 


In  the  general  n-dimensional  case  it  is  sufficient  to  assume 

-  k 

•  T  is  of  class  C  , 

•  a.  .  ,b.  €  Ck_1 (0 ) ,  and 

•  c,d  €  ck  2(0),  where  k  =  [n/2]  +  3. 

For  these  results  we  refer  to  Agmon  [1965,  Theorems  3.9  and  9.6], 

Eigenvalue  problems  on  domains  with  corners  arise  in  many 
applications  but  are  not  covered  by  the  above  results  because  of 
the  requirement  that  T  be  smooth.  Nevertheless,  when  0  has 
corners,  analogous  results  in  a  generalized  form  involving  weighted 
Sobolev  spaces  can  be  proved  for  problems  with  smooth  coefficients 
(see  Grisvard  [1985]  and  Babuska  and  Guo  [1987]).  Furthermore 
these  results  provide  information  on  the  behavior  of  u  near  the 
corners  that  is  important  in  assessing  the  accuracy  of  eigenvalue 
approximations.  This  matter  will  be  taken  up  in  Section  10.  We 
now  briefly  outline  the  extent  to  which  the  eigenpair  (A,u)  of 
(3.18)  satisfies  (3.1)  in  the  classical  sense  in  the  case  in  which 
Q  is  a  polygon  and  L  =  -1  and  d(x)  =  1.  From  regularity 
results  for  elliptic  equations  we  conclude  that  u  --  CV  (0- (vertices 
of  0}).  Thus  we  see  that  Lu(x)  =  AMu(x)  for  all  x  c  0  and 

u(x)  =  0  for  x  c  r  -  (vertices  of  o).  u  fails,  however,  to  be 

2 

an  eigenfunction  in  the  classical  sense  in  that  u  •£  C  at  any 
vertex  of  0  with  interior  angle  larger  than 

Under  the  hypothesis  sketched  above,  the  classical  and  varia¬ 
tional  formulations  of  (3.1)  are  equivalent.  For  the  remainder  of 
this  article,  we  will  take  the  point  of  view  that  our  eigenvalue 
problems  are  given  in  variational  form.  Thus  we  will  consider 


problems  of  the  form  (3.18),  or  problems  that  are  generalizations 

of  the  form  (3.18);  see  Section  8. 

Consider  now  the  case  of  the  Neumann  boundary  condition: 

S  u 

t^-(x)  =  0  for  x  e  r.  Suppose  (A,u)  satisfies  (3.1)  in  the  clas 
sical  sense.  Then,  using  (3.8)  we  find 


A  b  ( u ,  v ) 


=  a  ( u  ,  v )  + 

=  a  (  u ,  v )  , 


du 

dv 


vds 


Jr 

for  all 


v 


and  thus,  using  the  fact  that  C*(ft)  is  dense  in 
that  (A,u)  satisfies 


«-=  C1  (0  )  , 
H1  (0  )  , 


we  see 


(3.20) 


u  e  H1 (0 ) ,  u  *  0 

a(u,v)  =  Ab(u,v)  V  v  e  H1(0). 


(3.20)  is  a  variational  form  for  (3.1)  with  the  Neumann  condition. 

2  - 

Now  suppose  (A,u)  satisfies  (3.20)  and  assume  u  C  (ft).  From 
(3.20)  and  (3.8)  we  obtain 


Luvdx  =  a(u,v)  + 


ft 


(3.21) 


=  A  b  ( u ,  v )  + 


du  -  . 
3—  vdx 

dv 


du  - 
■3—  vds 

dv 


=  X 


duvdx  + 


ft 


du  -  .  w  _  1  ,  r  . 

Vds  ,  V  V  c  C  (ft  ) 


Taking  v  e  C1(ft)  which  vanish  on  r  we  find  that 


Lu  ( x )  =  *  u  (  x )  ,  V  x  r  . 


Thus  (3.21)  reduces  to 


-j—  vds 
di 


0 ,  V  v  e  C  (Q  )  , 


which  implies  that  —  =0  on  F.  Thus  we  have  shown  that  (X  ,u) 
satisfies  (3.1)  in  the  classical  sense.  As  with  the  Dirichlet 
condition,  the  analysis  is  valid  under  appropriate  smoothness 
assumptions  on  r,a^j,b^,c,  and  d.  We  will  not  state  these  in 
detail . 

Note  that  the  Neumann  boundary  condition  is  not  explicitly 
stated  in  (3.20).  It  is,  however,  implicitly  contained  in  (3.20). 
We  refer  to  the  Neumann  condition  as  a  natural  boundary  condition , 
in  contrast  to  the  Dirichlet  condition  which  is  referred  to  as  an 
essential  boundary  condition ,  and  which  is  explicitly  contained  in 
the  variational  formulation  (3.18).  The  fact  that  the  Neumann  con 
dition  is  natural  has  important  implications  for  the  approximation 
of  eigenvalues;  see  Remark  10.5. 

In  summary,  for  (3.1)  we  get  one  of  the  following  forms: 

Problem  1 :  Dirichlet  boundary  condition 

Seek  X  , u  *  0  satisfying 

u  -  hJ(Q) 

a ( u , v )  =  \ b ( u , v ) ,  Vve  H*(Q) 


Problem  2:  Neumann  boundary  condition 
Seek  ',u  *  0  satisfying 


u  e  H1 (0 ) 

a  (  u  ,  v  )  =  X  b  (  u  ,  v  )  ,  V  v  ■-  H 1  ( Q  ) 


We  will  sometimes  refer  to  (X  ,u) 


as  an  eigenpair  of  the  form  a 


relative  to  the  form  b.  Regarding  the  forms  a  and  b  we  assume 
(3.15)  -  (3.17)  hold. 


In  a  similar  way,  many  other  problems  —  including  all  of  the 
examples  discussed  in  Section  1  -  can  be  given  variational  formu¬ 
lations.  This  is  done  for  a  number  of  problems  in  Chapter  III. 

We  mention  in  particular  the  eigenvalue  problems  corresponding  to 
the  vibration  of  a  free  L-shaped  panel  (a  two  dimentional  ana¬ 
logue  of  the  elastic  solid) . 

Finally  we  wish  to  make  one  further  point  regarding  varia¬ 
tional  formulations  of  eigenvalue  problems,  namely,  that  a  given 
eigenvalue  problem  can  often  be  given  a  variety  of  different 
variational  formulations  and  that  some  of  these  may  lead  to  more 
effective  finite  element  methods  than  others.  We  illustrate  the 
possibility  of  various  variational  formulations  by  considering  the 
simple  model  problem 


a  ( u ,  v ) 


(au'v'  +  cuv)dx 


are  bounded  bilinear  forms  on  L2*[H2(0,1)  o  Hq(0,1)].  (3.23)  and 

(3.24)  are  equivalent  in  the  sense  that  (\,u)  is  an  eigenpair  of 
one  if  and  only  if  it  is  an  eigenpair  of  the  other. 

Another  formulation  is  obtained  as  follows.  If  we  let  <t  = 
au' ,  then  (3.22)  can  be  written  as  a  first  order  system  of  equa¬ 
tions  , 

-o'  +  cu  =  \u 

(3.25)  iu'  -  |  =  0 

d 

u(0)  =  u(l)  =  0. 


(3.25) 


(3.26) 


can  then  be  given  the  variational  formulation, 

Seek  *,(o,u)  ,r  L2  (  0 , 1  ) -  H*  (  0 , 1  )  satisfying 

- 

a2  (a  ,u,  t/1 ,  v)  =  \  b2  (a  ,u,  ,  v)  ,  V(>r,v)  -  L  (0, 1  )  ■  hJ(0, 1  )  , 
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where 


1 


and 


a2  (cr  (U.ip.v)  = 


(crv'  +  cuv  +  u> 


-  2i)d x 

a  ' 


b2(CT,u,v>,v) 


1 

uvdx . 

0 


a2  and  b2  are  bounded  bilinear  forms  on  L^H*.  (3.22)  and 

(3.25)  are  equivalent  in  the  sense  that  if  {\ ,u)  is  an  eigenpair 
of  (3.22)  and  a  =  au',  then  (X,{ u,cr))  is  an  eigenpair  of 

(3.25) ,  and  if  (\,(a, u) )  is  an  eigenpair  of  (3.25),  then  (\,u) 
is  one  of  (3.22)  and  a  =  au' .  (3.25)  and  (3.26)  are  called  mixed 
formulations  of  the  eigenvalue  problem  (3.22);  see  Section  11.  We 
can  also  write  (3.22)  in  the  form 


(3.27) 


where 


and 


'(a,u)  €  L2(0.1)*hJ(0,1).  (<t,u)  *  (0,0) 

•  A(c t,v>)  +  B  (  y> ,  u )  =  0,  V  ip  e  L2(0,1) 

r1  r  ^  —  1 

B(a,v)  -  J  cuv  dx  =  j  -A.  uv  dx,  V  v  e  HQ , 


A  (cr  ,ip) 


dx 


B (a  ,  v ) 


1 

av'  dx. 

0 


In  Chapter  III  we  will  consider  further  examples  of  variation 
al  formulations  and  show  how  they  can  be  used  to  define  a  variety 
of  finite  methods. 


In  this  section  we  discuss  the  basic  properties  of  eigenvalue 


problems.  As  in  Section  3  this  discussion  will  be  in  terms 
2nd  order  elliptic  eigenvalue  problems. 

We  thus  consider  the  problem  (3.1)  in  variational  form: 

("Seek  0  *  u  e  H  satisfying 


(4.1) 


a(u, v) 


\b(u,  v) ,  V  v  e  H, 


of 


where  H  =  H *(0)  for  Dirichlet  boundary  conditions  and  H  =  H*(Q) 
for  Neumann  conditions.  The  forms  a(*,*)  and  b(*,*)  are  assumed 
to  satisfy 


(4 

•2) 

1  a(u. 

v)  1 

s  Cl!|u|ll,0!!v»l,0' 

V  U,  V  € 

H, 

(4 

•  3) 

1  b(u. 

v)  | 

■  C2!'U!i0,Q!l  v"0,0  ' 

V  u,v  e 

H, 

and 

(4.4)  Rea(u,u)>a||ul!jQ,VueH, 

where  a  >  0.  Sufficient  conditions  for  (4.2)  -  (4.4)  to  hold 
were  given  in  Section  3;  cf.  (3.15)  -  (3.17). 

For  the  study  of  (4.1)  it  is  useful  to  introduce  the  operator 
T  :  H°(Q)  — ->  H  defined  by 


(4.5) 


Tf  e  H 

a(Tf,v)  =  b(f,v),  V  v  e  H. 


T  is  the  solution  operator  for  the  boundary  value  (source)  problem 


(4.6) 


fLu  =  df  in  0 
[bu  =  0  on  r  , 


i.e.,  u  =  Tf  solves  (4.6).  Thus  T  is  the  inverse  of  the  dif- 
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-r 


ferential  operator  L,  considered  on  functions  that  satisfy  the 


boundary  conditions.  It  follows  immediately  from  (4.2)  -  (4.4) 
and  the  Riesz  representation  theorem  in  the  special  case  in  which 
a(*  ,*)  is  an  inner  product  on  H  or  the  Lax-Milgram  theorem  (Lax 


and  Milgram  [1954])  in  the  general  case,  that  (4.5)  has  a  unique 

0 


solution  Tf  for  each  f  e  H  (ft)  and  that 

C 


(4.7) 


Tf!:l,0  s  o^O.O-  V  f  «  »°«»- 


i.e. ,  T  :  H  (ft) 


H  is  bounded.  In  Section  2  we  noted  that  H 


is  compactly  embedded  in  H  (ft ) ,  provided  f  is  Lipschitz  con¬ 


tinuous  (Rellich's  theorem).  From  this  fact  and  (4.7)  we  see  that 
0.„  .  ..0, 


T  :  H  (ft)  — »  H ~ ( ft )  is  a  compact  operator.  We  can  also  view 
as  an  operator  on  H;  we  will,  in  fact,  mainly  consider  T  on 
H.  Another  application  of  Rellich's  theorem  shows  that  T  :  H 
is  compact. 

It  follows  immediately  from  (4.1)  and  (4.5)  that  (*,u 
an  eigenpair  of  (4.1)  (or  of  L)  if  and  only  if 


is 


Tu  =  — u ,  u  *  0 , 


i.e.,  if  and  only  if  (/<  =  \  *,u)  is  an  eigenpair  of  T.  Through 
this  correspondence,  properties  of  the  eigenvalue  problem  (4.1) 
can  be  derived  from  the  spectral  theory  for  compact  operators.  A 
complete  development  of  this  theory  can  be  found  in  Dunford  and 
Schwartz  [1958,  1963].  We  now  give  a  brief  sketch  of  it;  a  slight 

ly  more  complete  treatment  is  given  in  Section  6.  We  present  this 
theory  under  the  assumption  that  the  space  H  is  complex.  This 
leads  to  the  simplest  general  statement  of  the  theory.  In  the 
special  case  in  which  T  is  selfadjoint,  H  can  be  taken  to  be 


real  or  complex.  We  will  specialize  to  the  selfadjoint  case  later. 
Denote  by  p(T)  the  resolvent  set  of  T,  i.e.,  the  set 

p(T)  =  {z  :  z<£  C  5  the  complex  numbers,  (z-T)  1 
exists  as  a  bounded  operator  on  H} , 

and  by  o( T)  the  spectrum  of  T,  i.e.,  the  set  a ( T )  =  C\p(T). 
o( T)  is  countable  with  no  nonzero  limit  points;  nonzero  numbers 
in  a(T)  are  eigenvalues;  and  if  zero  is  in  a(T),  it  may  or  may 
not  be  an  eigenvalue.  Let  0  0  p  e  ct(T).  The  space  N(p-T)  of 
eigenvectors  corresponding  to  p  is  finite  dimensional;  its 
dimension  is  called  the  (geometric)  multiplicity  of  p. 

Now  suppose  L  is  formally  selfadjoint.  Then  it  follows 
immediately  from  their  definitions  that  a(u,v)  and  b(u,v) 
satisfy 

(4.8a)  a(u,v)  =  a(v,u),  V  u,v  e  H, 

(4.8b)  b(u,v)  =b(v,u),  V  u,v  e  H°(0), 

i.e.,  a  and  b  are  symmetric  (or  Hermitian)  forms.  Thus  from 
(4.2)  -  (4.4)  we  see  that  a(u,v)  is  an  inner  product  on  H  that 
is  equivalent  to  ((u.v))^  In  a  similar  way  we  see  that  b(u,v) 

is  an  inner  product  on  H°(Q)  that  is  equivalent  to  (u,v)Q  Q 
(recall  that  d(x)  is  bounded  above  and  is  bounded  below  by  a 
positive  constant).  It  follows  from  (4.8)  that 

(4.9a)  a(Tu,v)  =a(u,Tv),  V  u,v  e  H, 

(4.9b)  b(Tu,v)  =  b(u,Tv),  V  u,v  e  H°(0), 

i.e.,  T,  considered  as  an  operator  on  H,  is  selfadjoint  with 
respect  to  a(u,v),  and,  considered  as  an  operator  on  H°(o),  is 
selfadjoint  with  respect  to  b(u,v).  (We  have  previously  noted  in 


(3.10)  that  b(4  Lu,v)  =  b(u,4  Lv)  if  u  =  v  =  0  on  F  or  if 
u  a 

^7  =  =0  on  r,  provided  L  is  formally  selfad joint . ) 

From  the  fact  that  T  is  selfadjoint  on  H  it  follows  that 
the  eigenvalues  of  T  are  real  and  the  eigenfunctions  correspond¬ 
ing  to  distinct  eigenvalues  are  orthogonal  with  respect  to  a(u,v) 
We  noted  above  that  T  is  compact  on  H  and  it  follows  from 
(4.5)  that  T  is  positive  definite.  Thus  T  will  have  a  count¬ 
ably  infinite  sequence  of  eigenvalues 

o  s  ...  1  n2  < 

and  associated  eigenfunctions 


selfadjoint,  then  (4.1)  (or  L) 


has  eigenvalues 


(4.12)  0  <  i  s  X  <  .  .  .  ✓"  -wo 

and  corresponding  eigenfunctions 

(4.13)  ux ,u  , . . . 
satisfying 

(4.14)  a(u;.,Uj)  =  lib(ujL,Uj)  =  6  . 

In  the  sequence  eigenvalues  are  repeated  according  to 

their  (geometric)  multiplicity.  The  properties  of  eigenvalues  and 
eigenfunctions  used  in  Section  1  in  the  discussion  of  separation 
of  variables  (cf.  (1.11)  -  (1.15))  all  follow  from  the  properties 
we  have  sketched  here. 

Although  our  discussion  has  been  in  terras  of  2nd  order 
elliptic  problems,  it  it  immediate  that  the  results  hold  for  any 
eigenvalue  problem  in  variational  form  provided  the  bilinear  forms 
are  symmetric  and  satisfy  (4.2)  -  (4.4).  We  will  refer  to  this  as 
the  selfadjoint,  positive  definite  case.  In  Section  8  this,  as 
well  as  a  more  general,  class  of  variational ly  formulated  eigen¬ 
value  problems  is  discussed. 

Remark  4.1.  The  eigenvalues  of  selfadjoint  eigenvalue  problems 
are  real  and  the  eigenfunctions  may  be  taken  to  be  real.  Thus 
these  problems  may  be  formulated  in  terms  of  real  function  spaces. 
Nonselfad joint  eigenvalue  problems,  on  the  other  hand,  may  have 

complex  eigenvalues  and  complex  eigenfunctions,  and  are  formulated 
in  terms  of  complex  spaces. 

We  end  this  section  with  a  discussion  of  the  regularity  of 
the  eigenfunctions  of  the  2nd  order  elliptic  operator  L.  L  is 
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not  assumed  to  be  formally  selfadjoint  here. 


Theorem  4.1.  Suppose  for  k  -  2, 

•  f  =  dQ  is  of  class  , 

•  a  .  .  ,  b  .  --  1  (0  )  ,  and 

i  J  i 

k-  ?  - 

•  c,d  -r  CK  *(Q  )  . 

k 

Then  all  eigenfunctions  of  L  (see  (3.2))  lie  in  H  (0)  and 


u  .  ,  ,, 
3  k,0 


Cik/2 

3 


Uj  0,0  ' 


j  =  1,2,  .  . 


Proof .  This  result  is  a  direct  consequence  of  standard  results  on 
the  regularity  of  solutions  of  elliptic  boundary  value  problems. 

In  particular,  we  refer  to  Agmon  [1965,  Theorem  9.8].  7 


Theorem  4.2.  Suppose 

•  f  is  of  class  C*',  and 

•  a..,b.,c,d-:  CS(Q  )  . 

Then  u^  ^  C*(6)  for  j  =  1,2,...  . 

Proof .  This  result  follows  directly  from  Theorem  4.1. 

Theorem  4.3.  Suppose 

•  r  =  30  is  analytic,  and 

•  a^,b^,c,d  are  analytic  on  Q. 

Then  is  analytic  on  0  for  each  j . 

Proof .  For  a  proof  of  this  result  see  Morrey  [1966,  Section  5.7]. 

In  practice  most  of  the  domains  of  interest  have  piecewise 

Let  us  mention  a  result  for  such  domains. 
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analytic  boundaries. 


Theorem  4.4.  Suppose 


•OcR2, 

•  T  is  piecewise  analytic,  and 

•  a^,b.  ,c,d  are  analytic  on  0  -  (vertices  of  0). 

Then  every  eigenfunction  of  L  is  analytic  in  Q  -  U  (vertices), 

2 

and  moreover,  belongs  to  the  space  'B^(O),  for  properly  chosen  8 

Proof .  This  theorem  follows  from  the  results  in  Babuska  and  Guo 
[1987a].  o 

Remark  4.2.  Assume  that  Lu  =  -iu,  0  is  a  polygon,  and  the  boun¬ 
dary  conditions  are  of  Dirichlet  type.  If  0  is  a  convex  poly- 

2 

gon,  then  the  eigenfunctions  u  e  h  (Q),  and  if  0  is  a  noncon- 

*  ^  1  f[ 

vex  polygon,  then  u  e  H  (0)  n  HQ(Q),  where  k  =  -+l,  with  a 
the  maximal  interior  angle  of  the  vertices  of  Q. 

For  a  comprehensive  treatment  of  regularity  results  for  prob¬ 
lems  on  domains  with  corners,  we  refer  to  Grisvard  [1985]. 


Section  5.  A  Brief  Overview  of  the  Finite  Element  Method  for 


Eigenvalue  Approximation 

In  this  section  we  give  a  brief  overview  of  the  use  of  finite 
element  methods  for  approximating  eigenvalues  and  eigenfunctions 
of  differential  operators.  We  will  restrict  the  discussion  to  a 
simple  model  problem  in  one  dimension  and  its  approximation  by  the 
simplest  type  of  finite  element  method. 

Consider  the  selfadjoint  eigenvalue  problem 


(5.1) 


(Lu)(x)  =  -(a(x)u')'  +  c(x)u  =  ld(x)u,  0  <  x  <  € 
u(0)  =  u( 1 )  =  0, 


where  a  e  C*[0f€],  c,d  e  C°[0,€],  and 

0  <  aQ  <  a(x) ,  0  <  c(x),  and  0  <  dQ  <  d(x)  for  0  <  x  <  £ 

(cf.  (3.1)  -  (3.4)).  As  indicated  in  Section  3,  this  problem  has 
the  variational  characterization 


(5.2) 

where 


and 


u  e  hJ(0,*) 

a ( u , v )  =  \b(u,v),  V  v  e  Hq(0,^), 


a(u, v) 


l 

a(x)u'v'dx 

0 


b  ( u  ,  v ) 


duvdx . 

0 


(5.1)  (or  (5.2))  has  a  sequence  of  eigenvalues 

0  <  *  1  i  \  2  '  .  .  .  S  +<» 
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and  corresponding  eigenfunctions 


satisfying 


\  . 

1 


U1 ,u2 ' ‘ ' 


d(x)u.u  .dx  =  6.  ., 
i  3  ij 


On  [0,*]  consider  an  arbitrary  mesh 


A={0=x0<x1<...<xn=€}1 


where  n  =  n(A  )  is  a  positive  integer,  and  let 


Sh  =  {U  :  U  e  C[0,<],  U( 0 )  =  u(<)  =  0, 

u  is  linear  on  1^,  j  =  l,.,.,n}. 


where  h .  =  x .  -  x .  , 
3  3  3-1 

and  h  =  h ( A )  =  max  h  . . 

j  J 


and  Ij  =  (x^_lfXj)  for  j  = 

is  an  (n-1 ) -dimensional  subspace  of 


Hq(0  ,-f)  .  The  pairs  (X,u)  have  been  characterized  in  (5.2)  as 
eigenpairs  of  the  bilinear  form  a(u,v)  relative  to  the  form 
b(u,v)  over  the  space  H*  (  0  ,  ?■)  >  H*  (  0  ,  (.)  .  We  now  consider  eigen¬ 
pairs  of  a(u,v)  relative  to  b(u,v)  over  the  space  Sh>Sh' 
i.e.,  we  consider  the  eigenvalue  problem, 


(5.3) 


Seek  ,  0  *  u, 
h  h 


Sh  satisfying 


a(uh,v)  =  xhb(uh,v),  V  u  •?  Sh, 


m 


r  *  /•  J 

Vv\ 

A  .V. 
k  ,  «  .  « 


and  then  view  the  eigenpairs  of  (5.3)  as  approximations  to  those  o«V<'. 

*  - 

of  (5.2).  (A.  -u. )  is  called  a  finite  element  (Galerkin)  approxi-  vV 

h  h  s-;v 

mation  to  (>,u).  A  wide  variety  of  finite  element  methods  for 

•  v  *, 

eigenvalue  problems  will  be  introduced  and  analyzed  in  Chapters 

-.j—'-sr 

III.  Here  we  will  outline  the  general  features  of  these  methods 
by  examining  the  method  (5.3)  as  it  applies  to  (5.1). 

. 
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Since  is  finite  dimensional,  (5.3)  is  equivalent  to  a 

generalized  matrix  eigenvalue  problem.  In  fact,  if  o  , . . . , o 

n-  1 

is  a  basis  for  S.  ,  then  it  is  easily  see  that  M  .  ,  u,  =  /  z  .o  . 

h  l  h  h  2  J 

j  =  l 

is  an  eigenpair  of  (5.3)  if  and  only  if 


(5.4) 


Az  =  V,  Bz 
h 

z  *  0  , 


where  z  =  ( , . . . , zn_1 )  and 


A  =  (  A  .  .  )  ,  with  A  .  .  =  a  (0  .  ,<i>  . 

ij  ij  J  i 


B  =  (  B .  . )  ,  with  B..  =  b ( 0  .  , o  . 

ij  ij  J  i 


5.3)  (respectively,  (5.4))  has  eigenvalues 


0  <  \  ,i  A  , 

1  ,  h  2  ,  h 


n- 1  ,  h 


and  corresponding  eigenfunctions 

-»  t 

u,  ,  ,  ...,  u  .  ,  (respectively,  z.  ,  =  (z.  ,  ,  ,...,z.  ,  ,  )  , 

l,h  n-l,h  ^  7  j,h  3,l,h  j,n-l,h 


j  =  1 , . . . , n-1 ) , 


satisfying 


A.  ,  du.  .  u  .  .  dx  =  b  .  .  (respectively,  .  z  .Bz.  ,  =<:>..). 

i  ,  h  i,hj,h  l  j  *  1  i  ,  h  i , h  j , h  l  j 

0 

We  further  note  that  if  we  choose  as  basis  functions  the  usual  hat 
functions  determined  by 


0  .  (  x  .  )  =<■■).., 
i  1  il 


then  A  and  B  are  sparse;  in  fact,  they  are  tridiagonal 
easily  see  that  the  three  nonzero  diagonals  are  given  by 


(5.5a)  A.  .  =  -h.  h.  a(x)dx+h.  h.  .  (x.  ,-x) (x-x. )c(x)dx 

i,i+l  l  i  +  l(  l  l+l  l+l  l 


■J 


x . 

l 


X  . 
1 


(5.5b)  A. .  =  h. 

ii  l 


-2 


X . 
l 


a(x)dx  +  h. 2 

l  +  l 


rXi  +  l 


xi-l 


a (x ) dx 


x . 

l 


+  hT2 

i 


x . 
i 


(x-x._1)  c ( x ) dx 


xl-l 


+  hI  +  l 


xi  +  l 


'Xi+1-X)  c(x>dx' 


Xi-1 


(5.5c)  A .  .  =  -h . 1 . hT 1 

1-1,1  l-li 


'Xi 


a ( x ) dx 


Xi-1 


+  hT1  hT1 

l-l  i 


Xi 


(x-x. ) (x-xA  ) c ( x ) dx , 


xi-l 


5.6a) 


B.  .  ,  =  h. 1h. 1 , 
i,i+l  l  i+l 


Xi  +  1 


(xi+1-x) (x-x. ) d ( x ) dx , 


x . 
1 


(5.6b)  B.  .  =  h. 

i,i  l 


-2 


x . 

l 


x-x.  . ) 2d ( x ) dx+h . 2 , 

i-l  i+l 


Xi-1 


Xi  +  1 


;x  -x)  d ( x ) dx 


x  . 

i 


(5.6c) 


B.  ,  ,  =  hT 1  hT 1 

i-l,i  i-l  l 


xi 


(xi-x) (x-xi_1 ) d ( x ) dx . 


‘i-l 


Now  we  specialize  (5.1)  to  the  vibrating  string  problem  dis¬ 
cussed  in  Section  1,  i.e.,  we  let  a(x)  =  p  =  the  tension  of  the 

string,  c(x)  =  0,  and  d(x)  =  r  =  the  density  of  the  string.  We 
also  suppose  the  mesh  is  uniform,  i.e.,  we  let  x^.  =  i'n  1  ;  we 


=  h .  =  fn 
1 


then  have  h 


It  is  easily  seen  from  (5.5a) 


(5.6c) 


If  the  integrals  defining  the  are  approximated  by  the  trape¬ 

zoid  quadrature  rule,  then  instead  of  the  matrix  B  we  would 
obtain  the  matrix 

(5.9)  B  =  rhl 
and  instead  of  (5.4)  we  would  have 

(5.10)  Az  =  \Bz. 

We  finally  note  that  the  eigenvalues  and  eigenvectors  of 
(5.4)  and  (5.10)  can,  in  this  special  case,  be  explicitly  found. 
The  eigenvalues  of  (5.4)  are  given  by 

(5.11)  \  j  h  =  6h  2(1-  cos  J^)(2+  cos  *pr  l,  j  =  1,2 . n-1 

and  those  of  (5.10)  by 

(5.12)  i  h  =  2h  2(1  -  cos  -^^)  pr  1 ,  j  =  l,2,...,n-l. 

The  unnormalized  eigenvectors  of  both  problems  are  given  by 


From  (5.13),  (5.14),  and  (5.16)  we  see  that,  neglecting  the  nor¬ 

malizing  factors,  the  eigenvector  z,  ,  consists  of  the  values  of 

J  •  ^ 

u . (x)  at  x  —  x.,x_,...,x  , . 

J  12  n-1 

(5.17)  shows  that  the  eigenvalue  error  \  .  is  0(h2). 

J.h  j 

Thus  the  small  eigenvalues  of  (5.3)  (or  of  (5.4))  approximate  the 

eigenvalues  of  (5.2) ,  but  the  larger  ones  do  not  since  v  .  -<  . 

J  >  h  j 

is  small  only  if  j2h  is  small.  If,  for  example,  j  •  n1/2, 

2 

then  j  h  is  of  order  one  and  we  would  not  expect  '  to  be 

i  -h  j 

small.  Thus  only  a  small  percentage  of  the  eigenvalues  of  (5.4) 
are  of  interest.  This  observation  influences  the  selection  of 


numerical  methods  for  the  extraction  of  the  eigenvalues  of  (5.4). 


We  also  note  that  (5.17)  and  (5.18)  show  that  X  .  ,  <  X  .  <  X  . 

J  .  h  j  j  ,  h 

for  h  small.  It  is  known  that  X  .  s  x  .  for  all  h;  cf. 

3  3 

(8.42)  . 


A  Physical  Interpretation  of  the  Finite  Element  Eigenvalue  Problem 


(5.10) 


We  consider  here  the  vibration  of  a  weightless  elastic  string 

loaded  with  several  point  masses.  Suppose  we  have  a  weightless 

elastic  string  of  length  l  loaded  with  n-1  particles  of  mass 

- 1  - 1  - 1 

m  at  distances  ,  2-£n  ,...,( n-1  )£n  from  one  end  and  fixed 

at  both  ends.  Gravity  is  assumed  to  be  negligible  and  the  par¬ 
ticles  are  assumed  to  move  in  a  plane.  We  shall  study  the  small 
free  vibrations  of  this  system  of  n-1  degree  of  freedom. 

Let  p  denote  the  tension  in  the  string  and  let  h  =  <n  1  . 

If  qi(t)  denotes  the  vertical  displacement  of  the  ith  particle 
the  particles  being  numbered  from  the  left  (see  Figure  5.1),  then 
the  equation  of  motion  for  the  ith  particle  is  easily  seen  to  be 


(5.19)  -mq"(t)  = 

where  we  assume  qQ 


-P 


'’i-r2qi+qi+i 


qn  -  o. 


i 


1 , 2 ,  . . . , n-1 , 


Figure  5.1. 


Elastic  String  with  Point  Masses. 


If  we  seek  separated  solutions  of  the  form 


q:(t)  =  z1q(t) 


or,  in  vector  form, 


Vl(t)  =  Zn-lq(t) 
q( t)  =  zq{  t)  , 


in  which  the  (discrete)  spatial  variable  j  and  the  temporal 
variable  t  are  separated,  we  find  that 

-mz.q"(t)  =  -p  ^ - —  q(t) 

or 


z .  , -2z . +z .  , 
l-l  1  l+l 


mz . 

l 


=  ~q ,  l  r-- .  for  all  i  and  t. 

q(  t ) 


Both  members  of  this  equation  must  equal  a  constant,  which  we 
denote  by  X.  We  are  thus  led  to  seek  ( X , z  *  0)  such  that 

|j(-z  j  +  2za  -  2i  +  1)  =  *mz^,  i  =  l,...,n-l, 
i.e.,  to  seek  eigenpairs  (l,z)  of  the  matrix 


2  -1 

-1  2  -1 


( 5 . 20 ; 


ph 


-1 


-1  2  -  1 
-1  2 


relative  to  the  matrix  ml,  and,  for  each  eigenvalue  x  ,  solu¬ 
tions  to  the  differential  equation 


(5.21) 


q" ( t )  +  \q(t)  =  0,  t  >  0. 


The  matrix  (5.20)  is  positive  definite.  Thus  it  has  n-1 


eigenvaiues 


0<\,  .  .  .  <  A  ,  , 

1 , h  2 , h  n-1 , h 


and  corresponding  eigenvectors  z  z  _  ,  ,  which  satisfy 

i  /  n  n  i  f  n 


\  .  ,  mz .  .  z  ,  .  =  6... 

i .  h  i,h  j  <  h  ij 


z.  ,  ,  ...,  z„  .  ,  thus  form  an  orthonormal  basis  (i.e.,  are 

1 ,  h  n- 1 ,  h 


complete)  in  (n-1 ) -dimensional  space.  Corresponding  to  \ 
the  solutions  of  (5.21)  are  given  by 


j.h' 


q ( t )  =  q,.(t)  =  sin  jfh(t+0j) » 

where  a^.  and  0^  are  arbitrary.  Thus  the  separated  solutions 
are  given  by 


(5.22) 


Zj,haj  sin  3  *  1 . "-1- 


As  with  the  vibrating  string,  it  is  easily  seen  that  all  solu¬ 
tion  of  (5.19)  can  be  written  as  the  superposition  of  the  separated 
solutions  (5.22).  These  simple  motions  are  called  the  eigenvibra- 
tions.  The  components  of  the  jth  eigenvibration  all  vibrate  with 
some  circular  frequency  /A.  ^  ~  and  phase  displacement  /X~  , 

and  the  components  are  proportional  to  the  components  of  z ^ 

Thus  is  the  frequency  and  h  the  shape  of  the  jL,‘ 

eigenvibration. 

A  complete  discussion  of  the  vibration  of  a  weightless  elastic 
string  loaded  with  several  point  masses  can  be  found  in  Courant- 
Hilbert  [1953]  and  Synge  and  Griffith  [1959]. 


We  now  draw  a  parallel  with  the  finite  element  problem  (5.10) 


It  follows  immediately  from  (5.7),  (5.9),  and  (5.20)  that  the 

eigenvalue  problem  that  we  obtained,  i.e.,  the  problem  of  finding 
the  eigenpairs  of  the  matrix  in  (5.20)  relative  to  ml,  is  iden¬ 
tical  to  the  eigenvalue  problem  (5.10)  provided  m  =  rh  =  r('n  1 . 

We  have  thus  arrived  at  the  following  physical  interpretation  of 
(5.10):  Consider  the  problem  of  a  vibrating  string  with  density 

r  and  tension  p.  Divide  the  total  mass  ri  of  the  string  into 
n-1  particles  of  mass  m  =  r€n  1 ,  which  are  placed  at  the  points 

Xi . x  .  and  two  particles  of  mass  r-£(2n)-1,  which  are 

placed  at  xQ  and  xn>  Then  the  eigenvalue  problem  corresponding 
to  this  system  is  identical  to  the  problem  (5.10)  arrived  at  by 
approximating  (5.2)  by  the  finite  element  method  (5.3),  and  then 
approximating  the  matrix  B  by  B  via  the  trapezoid  rule.  Thus 
the  finite  element  eigenvalue  problem  (5.10)  is  the  same  as  the 
eigenvalue  problem  that  arises  when  the  mass  of  the  string  is 
"lumped"  as  indicated  above. 

The  matrix  A  in  (5.7)  is  called  the  stiffness  matrix  and  B 
in  (5.8)  is  called  the  mass  matrix .  Because  of  the  physical  anal¬ 
ogy  we  have  noted,  B  is  called  the  lumped  mass  matrix  and,  in 
contrast,  B  is  sometimes  referred  to  as  the  consistent  mass 
matrix . 
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CHAPTER  II.  ABSTRACT  SPECTRAL  APPROXIMATION  RESULTS 

In  this  chapter  we  present  the  abstract  spectral  approximation 
results  we  will  use  in  the  sequel. 

Section  6 .  Survey  of  Spectral  Theory  for  Compact  Operators 

Since  the  differential  operators  we  consider  have  compact 
inverses,  our  approximation  results  will  be  developed  for  the 
class  of  compact  operators.  We  turn  now  to  a  survey  of  the  spec¬ 
tral  theory  of  compact  operators.  A  complete  development  of  this 
theory  can  be  found  in  Dunford  and  Schwartz  [1963,  Section  XI.9]. 

Let  A  :  X  — *  X  be  a  compact  operator  on  a  complex  Banach 
space  X  with  norm  !!•!•=!!*!!.  We  denote  by  p( A)  the  resol- 
vent  set  of  A,  i.e.,  the  set 

p ( A )  =  {z  :  z  e  c  s  the  complex  numbers,  (z-A)  1  exists 
as  a  bounded  operator  on  X} , 

and  by  a (A)  the  spectrum  of  A,  i.e.,  the  set  o (A)  =  C\p( A). 

For  any  z  e  p(A),  R2(A)  =  (z-A)_1  is  the  resolvent  operator. 
a (A)  is  countable  with  no  nonzero  limit  points;  nonzero  numbers 
in  *7  ( A )  are  eigenvalues;  and  if  zero  is  in  o(  A),  it  may  or  may 
not  be  an  eigenvalue. 

Let  p  e  ct(A)  be  nonzero.  There  is  a  smallest  integer  <> , 
called  the  ascent  of  p-A,  such  that  N({p-A)a)  =  N((p-A)a  1), 
where  N  denotes  the  null  space.  N{(p-A)a)  is  finite  dimension¬ 
al  and  m  =  dim  N((p-A)°)  is  called  the  algebraic  multiplicity  of 
p.  The  vectors  in  N((p-A)°)  are  called  generalized  eigenvectors 
of  A  corresponding  to  p.  The  order  of  a  generalized  eigenvec¬ 
tor  u  is  the  smallest  integer  j  such  that  u  £  N  (  ( /'  -  A )  ^  )  .  The 


generalized  eigenvectors  of  order  1,  i.e.,  the  vectors  in 


N(/'-A),  are,  of  course,  the  eigenvectors  of  A  corresponding  to 
i.i  .  The  geometric  multiplicity  of  <t  is  equal  to  dim  N(/./-A)  ,  and 
is  less  than  or  equal  to  the  algebraic  multiplicity.  The  ascent 
of  p-A  is  one  and  the  two  multiplicities  are  equal  if  X  is  a 
Hilbert  space  and  A  is  selfadjoint;  in  this  case  the  eigenvalues 
are  real.  If  p  is  an  eigenvalue  of  A  and  f  is  a  correspond¬ 
ing  eigenvector,  we  will  often  refer  to  (p,f)  as  an  eiqenpair  of 
A. 

Throughout  this  section  we  will  consider  a  compact  operator 
T  :  X  — ►  X  and  a  family  of  compact  operators  Th  :  X  — »  X,  0  .<  h 
<  1,  such  that  T^  — ►  T  in  norm  as  h  ^  0.  Let  p  be  a  non¬ 
zero  eigenvalue  of  T  with  algebraic  multiplicities  m.  Let  r 
be  a  circle  in  the  complex  plane  centered  at  p  which  lies  in 
P  ( T )  and  which  encloses  no  other  points  of  er(T).  The  spectral 
projection  associated  with  T  and  p  is  defined  by 


E  =  E  ( // ) 


1 

2ni 


Rz(T)dz. 

Jr 


E  is  a  projection  onto  the  space  of  generalized  eigenvectors 
associated  with  p  and  T,  i.e.,  R ( E )  =  N((p-T)a),  where  R 
denotes  the  range.  For  h  sufficiently  small,  r  c  p(T^)  and 
the  spectral  projection 


!  f 

Eu  =  E .(//)  =  — -  j  R  ( T  )  dz 
h  h  '  2n i  z  h 

Jr 


exists,  Eh  converges  to  E  in  norm,  and  dim  R(E^(/0)  = 
dim  R(E(/0)  =  m.  E^  is  the  spectral  projection  associated  wijth 
eigenvalues  of  Th  which  lie  in  !'  and  is  a  projec- 


tion  onto  the  direct  sum  of  the  spaces  of  generalized  eigenvectors 
corresponding  to  these  eigenvalues,  i.e., 

R(Eh)  =  X  N(  ^(h)'Th}  lHh)  >  ' 

H  ( h)ea  (  T^)  ,/i  ( h)  inside  r 

where  ascent  °f  /-/(h)-T^.  Thus,  counting  according 

to  algebraic  multiplicities,  there  are  m  eigenvalues  of  in 

F;  we  denote  these  by  /j  .  (h)  ,  .  .  .  ,n  (h).  Furthermore,  if  F'  is 

l  m 

another  circle  centered  at  ft  with  an  arbitrarily  small  radius, 

we  see  that  ^,(h),...,/i  (h)  are  all  inside  of  T'  for  h  suf- 
1  m 

ficiently  small,  i.e.,  lim  ju  .  ( h)  =  it  for  j  =  l,...,m. 

h-*0  J 

R(E)  and  R(E^)  are  invarian"t  subspaces  for  T  and  T^, 

respectively,  and  TE  =  ET  and  T.E.  =  E.  T.  .  {R  {T,  }  :  z  e  r,  h 

h  h  h  h  z  h 

small}  is  bounded. 

If  ft  is  an  eigenvalue  of  T  with  algebraic  multiplicity 
m,  then  ft  is  an  eigenvalue  with  algebraic  multiplicity  m  of 
the  adjoint  operator  T'  on  the  dual  space  X'.  The  ascent  of 
Jit-T  will  be  a .  E'  will  be  the  projection  operator  associated 
with  T'  and  ft ;  likewise  E^  will  be  the  projection  operator 
associated  with  T^  and  ^  (h)  ,  .  .  .  ,pt  (h)  .  If  f  e  x  and  f' 

X' ,  we  denote  the  value  of  the  linear  functional  f'  at  f  by 
<f  ,f'>. 

T'  here  is  the  Banach  adjoint.  If  X  =  H  is  a  Hilbert 

* 

space,  we  would  naturally  work  with  the  Hilbert  adjoint  T  , 

which  acts  on  H.  Then  ft  would  be  an  eigenvalue  of  T  if  and 

-  * 
only  if  //  is  an  eigenvalue  of  T  . 

Given  two  closed  subspaces  M  and  N  of  X,  we  define 


6 (M, N) 


=  sup  dist(x,N)  and  ^  { M ,  N)  =  max(«  (M,N)  ,?>  (N,M)  )  .  ^  ( M ,  N 

xeM 

'!  x!!  =1 

is  called  the  gap  between  M  and  N.  The  gap  provides  a  natural 
way  in  which  to  formulate  results  on  the  approximation  of  gener- 


»* 


w 


gSEMjj 


'AM 


£JX 


In  this  section  we  present  estimates  which  show  how  the 


eigenvalues  and  generalized  eigenvectors  of  T  are  approximated 


by  those  of  T,  .  Estimates  for  this  type  of  approximation  were 


obtained  by  Vainikko  [1964,  1967,  1970],  Bramble  and  Osborn  [1973], 


and  Osborn  [1975];  our  presentation  follows  Osborn  [1975].  We 


refer  also  to  Chatelin  [1973,  1981],  Grigorieff  [1975  a,b,c], 


Chatelin  and  Lemordant  [1978],  Stummel  [1977],  and  to  the  excellent 


and  comprehensive  monograph  of  Chatelin  [  1983].  Let  /.t  be  a  non¬ 


zero  eigenvalue  of  T  with  algebraic  multiplicity  m  and  assume 


the  ascent  of  ft-T  is  a.  Let  /./  ( h /<m  ( h)  be  the  eigen¬ 


values  of  T.  that  converge  to  j« . 


Theorem  7.1.  There  is  a  constant  C  independent  of  h,  such 


that 


(7.1) 


SIR'S), F(Eh)>  <  Ct!  <T-Th)  |R(E) 


for  small  h,  where  (T-T^)|^(£)  denotes  the  restriction  of 


T-T,  to  R(  E )  . 
h 


Proof .  For  f  e  R(E)  with  llfii  =  1  we  have  ilf-E.fii  =  ME-E,  )  f ' 


s  I!  E-E.  !•  .  Thus,  since  E,  converges  to  E  in  norm, 


lim  6 (R(E) ,R(E.  ) )  =  0.  Using  Theorem  6.1,  with  M  =  R(E)  and 
h-»0  n 


N  =  R(E^),  we  thus  have 


6 (R(Eh) ,R(E) )  s  8(R(E) ,R(Eh) ) [1-6 (R(E) ,R(Eh) ]' 


26 (R(E) ,R(Eh> ) 


and  hence 


(7.2 


6 (R(E)  ,R(Eh)  )  ?  26  <R(E)  , R ( Eh )  ) 


Now  for  f  e  R(E)  we  have 


II  f-Ehf  ||  =  ||Ef-Ehf!! 


( 


[Rz(T)-Rz(Th)  ]  fdzl! 

r 


Rz(Th>  {T-Th)Rz(T)fdZl 

r 


and  hence,  recalling  that  R(E)  is  invariant  for  T  and  thus  for 
R2(T), 

(7.3)  >|f-Ehf!l<-~  length(r)  sup||Rz(Th) !!  II  (T-Th)  |R(E) !!  supll  R^  ( T )  |j  >1  f  ,t  . 


As  noted  above,  sup  |j  R  ( T.  )  i!  is  bounded  in  h.  Thus  from  (7.2) 

rz  n 

and  (7.3)  we  have 

6(R(E)  ,R(Eh)  )  <  Cil  (T-Th)  |R(E)il  , 

where 

C  =  i  length(D  sup  i!  R  ( T.)  !l  sup  !<R  (T)!|. 
n  zer  zer 

0<h 

Remark  7.1.  The  proof  of  Theorem  7.1  also  shows  that 

«|E-VIr(E|-  5  C!"T-VIR(E)9' 

Although  each  of  the  eigenvalues  /.<  ^  (  h ),...,  /./  (  h )  is  close  to 
/j  for  small  h,  their  arithmetic  mean  is  generally  a  closer  ap¬ 
proximation  to  (cf.  Bramble  and  Osborn  [1973]).  Thus  we  define 

m 

.  1  X"3lh)- 

j  =  l 


Out  next  theorem  gives  an  estimate  for  //-//(h). 


be  any  basis  for  R(E)  and  let 


Theorem  7.2.  Let  d>  ,...,<*> 

-  1  m 

<z>  1  be  the  dual  basis  in  R  ( E'  )  ,  as  defined  in  the  proof 

i  m  r 

to  follow.  Then  there  is  a  constant  C,  independent  of  h,  such 


that 


m 


(7.4)  \u-u(h)\ 


!]Tl<(T-Th), ».,«■>!«:  (T-Th)|R(E)  (T'-ThMR(r 


5=1 


Proof .  For  small  h,  the  operator  E)-Jr(E)  :  R^E)  — ♦R(E^)  is 


one-to-one  since  i  E-E, 


f  =  Ef-E,  f 
h 


e-e.  ;i  f , 

h 


0  and  E^f  =  0,  f  =  R(E)  implies 
and  Eh  ^  R  (  E  )  ;’'s  onto  since 


dim  R(Eh)  =  dim  R ( E )  =  m.  Thus  <EhlR(E))  1  :  R(Eh)  ~ *  R(E)  ls 
defined;  we  write  E^1  for  ^h^RfE)^  **  For  h  sufficiently 
small  and  f  €  R(E)  with  if  =  1  we  have 


1  -  Ehf:  =  '  Ef  -  ii  Ehfi  s  i.E-Eh 


1/2 


and  hence  ;  E,  f|;  i  1/2  f  .  This  implies 


-1 


We  note  that  E,  E,  ‘ 
h  h 


is  the  identity  on  R(E^) 

.-1. 


identity  on  R  (  E  )  .  Now  we  define  T,  =  E,  T,E,  I  „  ,  „  4 

h  hhh'R(E) 

R  (  E )  . 


is  bounded  in  h. 

is  the 
R(E)  — * 


and  E,  1E, 
h  h 


Using  the  fact  that  R(E,  )  is  invariant  for  T.  we  see  that 

h  h 

(  Th )  =  i/-1  j  ( h ),...,  //  (  h )  }  and  that  the  algebraic  (geometric,  res¬ 
pectively)  multiplicity  of  any  /v ^  ( h )  as  an  eigenvalue  of  T^  is 
equal  to  its  algebraic  (geometric,  respectively)  multiplicity  as 
an  eigenvalue  of  T,  .  Letting  T  =  Tl_,_.,  we  likewise  see  that 
<T(T)  =  {//).  Thus  trace  T  =  m//  and  trace  T^  =  m//(h)  and,  since 
T  and  act  on  the  same  space,  we  can  write 


(7.5) 


)‘~l<  (h) 


1 

m 


trace  (  T-T,  )  . 

h 


Let  0 


.  ,0. 


1  '  '  m 

the  dual  basis  to  o 


be  a  basis  for  R(E)  and  let  0 ^ ,  .  . 


,0  '  t 
m 


1  '  ' 


,0. 


m 


Then  from  (7.5)  we  get 
m 


(7.6) 


/u-/v(h)  =  -  trace  ( T-T.  ; 

m  h 


.  i  y 

m 

j  =  l 


<  (  T-T.  )0 

h  3  j 


Here  each  0 \  is  an  element  of  R(E)',  the  dual  space  of  R ( E ) 


but  we  can  extend  each  0!  to  all  of  X  as  follows.  Since  X 


R(E)  N(E)  ,  any  f  ^  X  can  be  written  as  f  =  g+h  with  g 


R(E)  and  h  e  N(E).  Define  <f )>  =  <g  ,Q \>  .  Clearly  0'.,  sc 


extended,  is  bounded,  i.e.,  0 


X' .  Now  <f,(/;-T')  0  .  >  = 


,a 


<(/J- T)  f 0  vanishes  for  all  f.  This  follows  from  the  obser 


vation  that  it  obviously  vanishes  for  f  •£  R(E)  =  N((/j-T)  )  anc 
it  vanishes  for  f  N(E)  since  N(E)  is  invariant  for  /j- T. 


Thus  we  have  shown  that  0  '  .  .  .  ,0  '  €  R(E'). 

1  m 


-1, 


Using  the  facts  that  ThEh  =  Ev.Tv.  and  E„  E^  is  the  iden- 


h  h 


n  n 


tity  on  R(E),  we  have 


-1, 


<  (  T-T,  )0  .0  '.  >  =  <T0  . -E,  T,  E,  0  .  ,0'.> 
h'jj  jhhhj'j 


(7.7) 


,-1, 


=  <E,  E,  (  T-T,  ) 0  .  ,0  '.> 
h  h  h  j  j 


=  «T-Th)0.,o'.>  +  <(EhEh-I)(T-Th)0j,0j>. 


,-l. 


Let  L,  =  E,  E,  .  L, 
h  h  h  h 


is  the  projection  on  R(E)  along  N(E 


Hence  L 
N  (  E'  )  .  Thus 


h  is  the  projection  on  N ( E^ )  =  R(Eh)  along  R(E) 


(7.8)  <(EhEh-I)  (T-Th)0  =  <(Lh-I)  (T-Th  )••■»  ,  (E'-Eh)0  ■>. 


From  (7.8),  the  boundedness  of  ,  and  Remark  7.1  (applied  to 


T'  and  (T'})  we  have 


(7.9)  !<(EhEh-I)(T-Th  ><*  <»'>!  £  (  sup,1 .1^-11  \) "  (  T-Th  )  I  R  £  } 

j  J  ft 


I  (E'-E'  )  i<t>  . 

h  1  R  (  E  )  j  3 


<  Ch  (T  Th)  |  R(  E  j  •  (T  Th)  Ir(E')  • 


Finally,  (7.6),  (7.7),  and  (7.9)  yield  the  desired  result. 


Remark  7.2.  Our  treatment  of  the  term  < ( E' E,  -I ) ( T-T  )d  .  ,0 '  > , 
-  h  h  h  j  j 


which  differs  from  that  in  Osborn  [1975],  was  suggested  by 


Descloux,  Nassif,  and  Rappaz  [1978b]. 


Remark  7.3.  A  slight  modification  of  the  proof  of  Theorem  7.2 


shows  that  for  any  1  <  I,  j  <  m,  !  <  ( T-T^  )<t>  _  ,<t>  _>  I  is  bounded  by 

j  I 


C5,  ,  where 
h 


8h  *  X 

i  ,  j  =  l 


Noting  that  < ( T-T^ )b ^ ,0 ^ >  is  a  matrix  representation  of  T-T^, 


we  see  that 


(7.10) 


T-T,  :  s  C6,  . 
h  h 


Since  it  is  immediate  that 


|/v-/j(h)|  =  ^|  trace(T-Th)  |  <  T-Th  , 


from  (7.10)  we  get 


(7.11) 


I  H  (  h )  I  <  C6  h  , 


an  estimate  that  is  similar  to,  and  of  equal  use  in  applications 


as,  (7.4). 


We  also  have 


1  *  - 1  * -1 
=  -I  traced  -T.  ) 

m  h 


Hence  we  see  that 

V>  .(h)  1 

(7.12)  \n - - 1  a  CS  . 

m  h 

It  is  also  known  that 

(7.13)  \  fj-fj  (h)  |a  <  C  T-Th 
for  any  1  <  j  s  m .  Hence 

(7.14)  l/n-/ij(h)  |a  s  C6h. 

(7.14)  is  established  directly  in  Theorem  7.3.  We  note,  however, 
that  the  proof  of  Theorem  7.3  is  closely  related  to  one  of  the 
ways  of  proving  (7.13). 

Remark  7.4.  It  follows  immediately  from  (7.4)  that 

'"-"j"111  !  c  <t-VIR|E)  • 


However,  the  second  term  on  the  right  side  of  (7.4)  is  of  higher 


order  than  (T-T,  ) 


namely  of  order 


h' 1 R{ E)  ' 

(T-Th)|RjEj  ;  (  T'  -T^  M  r ( £ ) !  •  We  will  also  see  that  in  a  large 


m 


variety  of  applications,  ^  c  (  T-Th  )o  j  ,0  >  is  of  higher  order 

j  =  l 

than  <T-Th)|R(E)  . 

In  addition  to  estimating  //-//(h)  we  may  estimate  .//-//.  (h) 


for  each  j  . 


Theorem  7.3.  Let  11  be  the  ascent  of  <i- T.  Let  <p  be 

-  1  m 

any  basis  for  R(E)  and  let  be  the  dual  basis.  Then 

l  m 

there  is  a  constant  C  such  that 


m 

7.15)  |  //-//  (  h )  !  =  C  {  X  !  <  (  T  ) 0  i  < 19  j  >  ■ 

i  -  j  =  l 


+  (T-ThMR(E)  ^t'-t')|R(e, 


l/a 


,  j  =  1  ,  .  .  .  ,  m  . 


Proof.  For  each  h,  /j  .  ( h)  is  one  of  the  eigenvalues  of  T,  .  Lei 
-  j  h 

T,w.  =  ij.{  h)w.  ,  :  w.  1  =1.  We  can  choose  w'  e  N((/v-T')a)  in  such 

hh  j  h  h  h 

a  way  that  <wh'wh>  =  1  an<^  t*’ie  norms  w' :  are  bounded  in  h. 

First,  using  the  Hahn-Banach  Theorem,  choose  €  R(E)'  such 

that  <w.  ,w'>  =  1  and  1  w' v  =  1;  then  extend  w'  to  all  of  X 
h  h  h  h 

as  in  the  proof  of  Theorem  7.2.  w^,  so  extended  will  be  in 

R(E')  and  satisfy  iw^i  s  '  E  *  Now,  noting  that  (T'-/J)aw^  =  0, 
we  have 


(7.16) 

I  l‘~ll  j  <  h)  I*"*  =  I  <(/<-/./ ^  (h)  )Uwh,w^>( 

=  I  <(  (//-/<j(h)  )a-(//-T)uwh,w^>  | 
a  - 1 

=  I  ( //  ~fJ  j  (  h )  )  j  (/J-T)<'_1“:i  (/i  .  (h)-T)wh,w^>| 

j=0 

o-l 

/  \/J~/J  ^  (h)  [  J  1  <  (n  i  (h)  ~T)wh,  (/i-T  )<,~1~-iw^>  1 
j=0 

o-l 

<_  y  |  //-//.( h )  (  ^  max  \<(fj  (h)-T)w  ,o'>(  /i-T  "  1_"’  w, 

3  0  r  (  e  1  ) ,  <y  ■  =  i  J  n  r 


i  —  n 


For  any  0'  e  R(E')  with  10 


=  1, 


(7.17) 

|  <(//j(h)-T)wh.0'>|  =  |  <(Th-T)wh,0'>( 

■  l<EhlEh(Th-T|wh''‘'>l 

=  I  <(T-Th)wh,<I>'>  +  <(E“1Eh-I)  (T-Th)wh,«'>  ! 

S  l«VT>V*'>l  +  C!  (T-Th>lR(E)  r  (T'-ThMR(r)  • 

There  is  obviously  a  constant  C'  such  that 

m 


(7.18) 


|  <(Th-T)wh,0'>  ( 


s  C' 


X  KIV11*]'*?1 

i  -  j  =  l 


for  all 
(7.16)  - 


e  R(E)  and  0'  e  R(E')  with 
(7.18)  we  get  the  desired  result. 


11 V 


■10' 


1  .  From 


Theorem  7.1  shows  how  the  generalized  eigenvectors  of  T  are 
approximated  by  those  of  T^.  Our  next  result  concerns  the  proxi¬ 
mity  of  certain  elements  of  R(Eh>  to  certain  elements  of  R(E). 
It  shows,  for  example,  that  eigenvectors  of  T^  are  close  to 
eigenvectors  of  T. 


Theorem  7.4.  Let  /./(h)  be  an  eigenvalue  of  T^  such  that 

lim  /j(h)  =  ij  .  Suppose  for  each  h  that  w,  is  a  unit  vector 
h-0  n 

1c 

satisfying  (//(h)-T^)  w^  =  0  for  some  positive  integer  k  u  . 

Then,  for  any  integer  /  with  k  •  f  s  a ,  there  is  a  vector  u,_ 

h 

such  that  (/j-T)^uh  =  0  and 

(■7.19)  «h-„h:  C.  (T-Th)|R(E)  (,*k+1,/". 

f 

Proof  .  Since  N((//-T)  )  is  finite  dimensional,  there  is  a  closed 

f 

subspace  M  of  X  such  that  X  =  N((/z-T)  )  -  M.  For  y 


»4< 


l  ( 

R (  ( — T)  ),  the  equation  (»-T)  x  =  y  is  uniquely  solvable  in  M. 

r .  t 

Thus  ( //  —  T )  I  :  M  — »  R((//-T)  )  is  one-to-one  and  onto.  Hence 
:  R((/;-T)^)  — >  M  exists  and,  by  the  closed  graph 
theorem,  is  bounded.  Thus  there  is  a  constant  C  such  that 
<  C  ;  (/i- T)*f  :  for  all  f  c  M . 

t 

Set  u^  =  Pwh,  where  P  is  the  projection  on  N((/j-T)  ) 

Z 

along  M.  Then  {(J- T)  u,  =  0  and  w,  -u.  e  M,  and  hence 

n  n  n 


(7.20) 


ii wh-uh''  5  Cii  (/u-T)  (wh-uh)  i: 


By  Theorem  7.1  there  are  vectors  c  R(E)  such  that 

lwh“V  s  C''!  (T_Th>  Ip, pm  I  • 


h '  1  R  (  E  )  ;  * 


Hence  there  is  a  constant  C"  such  that 
i:[(/v-T)€  -  (A/-Th)€]wh;l 


(7.21) 


=  (^-Th)j<T-Th)  (A'-T)€*j  :[(wh-uh)  +  uh] 


<  C":  (T-T 


h  1  R  (  E ) 


Since  k  s  l, 


wh 


(7.22 


Y  <  j)  (/^(h)  )  j(Mh)-Th)f  jwh 


Y  <  j  )U‘~V{h)  )j(^(h)-Th)/  j< 
j=-e-k+i 


C'"  \  ij-/i  (h)  | 


Combining  (7.20)  -  (7.22)  we  get 


f-k+ 1 


C  (//- T)  (  w,  —  u,  ) 


% 


<  C  ( H  -T )  wh: 


=  C-  [U*-T)*-{ti-Th)e']vih  +  (^-Th)^wh 


s  C[C"i:  (T-Th)  |R(E)li  +  C"'\u-(j{h) \l  k+1  ] 


The  result  now  follows  immediately  from  Theorem  7.3. 


Remark  7.5.  If  X  =  H  is  a  Hilbert  space,  we  let  T  and  T. 


denote  the  Hilbert  adjoints  of  T  and  T^»  respectively.  In 


Theorems  7.2  and  7.3  we  would  let  <Z> ,  ,  .  .  .  ,0  be  an  orthonormal 

l  m 


basis  for  R(E)  and  let  0  ^  =  E  0  y  Then  0  ^  ,  .  .  .  ,0  ^  ■€  N(  (/”- T  )°  ) 


and  trace  (T-Th)  =  ^ ( ( T-Th)0  .  ,0  *  )  ,  where  (•,*)  =  ( •  , * ) H 

3=1 

denotes  the  inner  product  on  H,  and  with  only  minor  modifica¬ 


tions  all  the  results  of  this  section  remain  valid. 


We  end  this  section  by  specializing  the  results  in  Theorems 


7.1  -  7.4  to  the  case  where  X  =  H  is  a  Hilbert  space  and  T  and 


T,  are  selfadjoint.  If  is  a  nonzero  eigenvalue  of  T,  then, 


as  noted  above,  the  ascent  a  of  fJ-T  is  one  and  the  algebraic 


and  geometric  multiplicities  of  ij  are  equal.  Likewise  the 


eigenvalues  /i^(h)  of  T^  which  converge  to  //  have  equal  alge¬ 


braic  and  geometric  multiplicities.  tt  and  ^^(h)  are,  of 


course,  real. 


Thus,  under  the  present  hypotheses.  Theorems  7.2  and  7.3  give 


the  estimate 


H-V  j(h)  t  •-  C  {  ^ 


((T-Th)0.,o*)|  +  (T-Th)|R(E)  2),  j  =  1 . m. 


i  .  3  =  1 


Now  consider  Theorem  7.4  in  the  selfadjoint  case.  Suppose 


:>i  :v:v  :v  :>££■ >: 


/7(h)  is  an  eigenvalue  of  T.  converging  to  // .  If  w,  is  a 

h  h 

unit  eigenvector  of  corresponding  to  //(h),  then  it  follows 

immediately  from  Theorem  7.1  and  the  definition  of  A ( R( E ) , R ( E. ) ) 
that  there  is  an  eigenvector  u^  of  T  corresponding  to  n  such 
that 


uh‘wh 


C  (  T-T 


h  1  R  (  E  )  ' 


This  is  Theorem  7.4  in  the  case  a  =  1.  We  further  note  that  one 
may  assume  !l  u^  =  1 •  From  Theorem  7.1  we  can  also  conclude  that 
if  u  is  a  unit  eigenvector  of  T  corresponding  to  //  then 
there  is  a  unit  eigenvector  wh  c  R(Eh)  of  Th  such  that 

u~wh ■  =  C! <T-Th) |R(E)  . 

Compare  the  discussion  of  the  Ritz  method  near  the  end  of  Section 


Remark  7.6.  In  the  selfadjoint  case  one  may  assume  the  Hilbert 

space  H  is  real  (cf.  Remark  4.1).  Starting  with  a  real  space  H 

we  can  in  the  usual  way  obtain  a  complex  space  by  complexifying. 

Then  the  contour  integrals  [  R  (T)dz  and  R  (T.)dz, 

2niJr  z  2/JiJj.  z  h 

which  are  the  fundamental  tools  in  the  analysis,  can  be  introduced 
and  the  results  derived.  The  results  will  be  in  the  complex 
context  but  can  immediately  be  translated  to  the  real  context. 

Remark  7.7.  Results  for  noncompact  operators  T  which  parallel 
those  in  this  section  were  proved  by  Descloux,  Nassif,  and  Rappaz 
(1978a, b].  See  Subsection  13. D. 


Section  8 .  Spectral  Approximation  of  Variationally  Formulated 


Eigenvalue  Problems 

As  explained  in  Section  3,  eigenvalue  problems  can  be  given 

j  variational  formulations.  For  the  most  part,  we  will  consider 

eigenvalue  problems  formulated  in  this  manner.  In  this  section  we 
I 

!  will  first  sketch  the  functional  analysis  framework  for  variation- 

i  ally  formulated  eigenvalue  problems  and  then  discuss  their  approx- 

I 

imation.  Results  of  the  type  presented  in  this  section  specifi¬ 
cally  Theorems  8.1  and  8.3,  were  proved  by  Babuska  and  Aziz  [1973, 

i 

Chapter  12]  and  Fix  [1973]  for  the  case  of  an  eigenvalue  with  mul- 
!  tiplicity  one;  in  the  general  case  they  were  proved  by  Kolata 

[1978].  Our  treatment  is  similar  to  Kolata's. 

< 

Let  H1  and  H 2  be  complex  Hilbert  spaces  with  inner  pro- 
j  ducts  and  norms  (  •  ,  •  )  A  and  ^  and  (*,*)2  and  *  1 2  ’  res~ 

pectively.  Let  a(*,*)  be  a  bilinear  (or  sesqui 1 inear )  form  on 

3  H1  H2  satisfying 

j  (8.1)  !a(u,v)  |  «-  C1!u'-1:v:2,  V  u  H1(  V  v  «  H2, 

(8.2)  inf  sup  |a(u,v)|  =  a  >  0, 

■  u^  H  ve  H 

v  1  Z 

•  u;l  =1  V  '  =1 

1  2 

3  and 

» 

(8.3)  sup | a ( u , v) (  >  0,  Vve  H_  with  v  *  0. 

>  veH, 

t  1 


The  Riesz  representation  theorem  and  (8.1)  imply  that  there 
is  a  bounded  linear  map  A  from  H1  to  H 2  such  that  a(u,v)  = 

( Au  <  v )  2  for  all  u  e  v  H  .  The  adjoint  A'  is  a  bounded 

map  from  to  satisfying  a(u,v)  =  (u,A'v)  for  all  u 

H  ,  v  H2.  (8.1),  (8.2),  and  (8.3)  imply  that  A  is  an  isomor- 


phism  of  onto  H2  .  In  fact,  in  the  presense  of  (8.1),  (8.2) 

and  (8.3)  hold  if  and  only  if  A  is  an  isomorphism,  cf.  Babuska 
[1971]  and  Babuska  and  Aziz  [1973,  Chapter  5].  Using  the  fact  that 
A  is  an  isomorphism  if  and  only  if  A'  is  an  isomorphism  we  see 
that  in  the  presence  of  (8.1),  (8.2)  and  (8.3)  hold  if  and  only  if 

(8.4)  inf  sup  |a(u,v)|  =  o  >  0 

ve  H  ue  H 

u.1=1 

and 

(8.5)  sup  |a(u,v)|  >  0,  V  u  e  H„  with  u  *  0 . 
veH2 

(8.2)  and  (8.3)  (or  (8.4)  and  (8.5))  are  called  the  inf-sup  con¬ 
ditions  . 

Suppose  !!  •  |i  j  is  a  second  norm  on  H1  which  is  compact  with 

respect  to  !i  •  •!  ^ ,  i.e.,  every  sequence  in  H1  which  is  bounded 

in  !>  •  1  has  a  subsequence  which  is  Cauchy  in  •  ^ .  Let  b(u,v) 
be  a  bilinear  form  on  H  >h  satisfying 

(8.6)  |  b(u,  v)  |  <  C2:;u"  j!(  v!;  2  ,  Vue  ,  v  £  H2  . 

We  remark  that  in  many  applications  the  form  b(u,v)  is  defined 
on  w1‘w2<  where 

H  r  wl'  w^^h  a  compact  imbedding, 

H2  ■-  w2 ,  with  a  bounded  imbedding, 

and  satisfies 

(8.7)  !  b(u,  v)  |  ?  C  u '  ;v!l  ,  Vue  w,  ,  u  e  . 

2  wl  w2  1  2 

If  • '  =  •  then  it  is  immediate  that  •  '  is  compact 

■*  w  ^  1 

with  respect  to  •  ■  and  that  (8.6)  holds. 


It  is  shown  in  Babuska  [1971]  and  Babuska  and  Aziz  [1973, 
Chapter  5]  that  (8.1)  -  (8.3)  imply  there  are  unique  bounded 


operators 

T  :  H,  Hx 

and  T* 

: 

H2  -  H2 

satisfying 

(8.8) 

a(Tu,v)  = 

b(u, v) , 

V 

U  e  Hx,  V 

v  e  H2, 

a(u,T*v)  = 

b(u,v) , 

V 

u  <£  H1 ,  V 

v  *  H2. 

Furthermore 

C2 

(8.9)  i  Tui!  <  jpiiui!'  ,  Vue  H  . 

If  u^  is  a  bounded  sequence  in  H1 ,  then,  since  '  is 

compact  with  respect  to  il  •  ||  ,  we  know  here  is  a  subsequence  u  . 

1  . 

that  is  Cauchy  in  !!•!:'.  It  then  follows  immediately  from  (8.10), 

applied  to  u.  -u.  ,  that  Tu .  is  Cauchy,  and  hence  convergent, 

Jk 

in  Hj.  Thus  T  :  H1  — »  H1  is  compact.  It  is  immediate  that 

* 

a(Tu,v)  =  a(u,T#v) .  The  operator  T,  is  related  to  T  ,  the 

*  _  i 

usual  adjoint  of  T  on  H1 ,  by  the  transformation  T  =  A'T*  A' 

* 

T  and  T*  are  compact. 

A  complex  number  \  is  called  an  eigenvalue  of  the  form  a 
relative  to  the  form  b  if  there  is  a  nonzero  vector  u  Hl  , 
called  an  associated  eigenvector ,  satisfying 

(8.10)  a(u,v)  =  \b(u,v),  V  v  e  H2  . 

It  is  easily  seen  from  (8.8)  that  (*,u)  satisfies  (8.10)  if  and 
only  if  *Tu  =  u,  i.e.,  if  and  only  if  O  1,u)  is  an  eigenpair 
of  the  compact  operator  T.  (8.10)  is  referred  to  as  a  variation- 
ally  posed  eigenvalue  problem  (cf.  (3.18)).  The  notions  of  ascent, 
generalized  eigenvector,  and  algebraic  and  geometric  multiplicities 


are  defined  in  terms  of  T.  The  generalized  eigenvectors  of  T 


corresponding  to  X  can,  however,  be  characterized  in  terms  of  the 
forms  a(*,*)  and  b(*,*)-  u^  is  a  generalized  eigenvector  of 
order  j  >  1  if  and  only  if  a(u^,v)  =  \b(u^,v)  +  \a(u^  1,v)  for 

all  v  e  H2 ,  where  u"1  1  is  a  generalized  eigenvector  of  order 

-1  *  * 
j-1.  Since  T*  =  A  T  A',  it  is  immediate  that  a(T,)  =  <i  ( T  ) 

and  that  N((\  ^-T*)^)  =  A'  1{N((X  1-T  )  ^  )  }  .  From  this  we  see  that 
the  generalized  eigenvectors  of  T*  have  a  similar  characteriza¬ 
tion  to  those  of  T,  namely,  a(u,v^)  =  Xb(u,v^)  +  'a(u,v^  1)  for 
all  u  e  H1 .  In  particular,  [X  *,v)  is  an  eigenpair  of  T*  if 

and  only  if  a{u,v)  =  ib(u,v)  for  all  u  e  H  ,  i,e.,  (*,v)  is 

an  adjoint  eigenpair  of  (8.10). 

In  order  to  construct  approximations  to  the  eigenvalues  and 
eigenvectors  of  (8.10)  we  select  finite  dimensional  subspaces 

S1  h  c  Hl  and  S2  h  C  H2'  indexed  by  a  parameter  h,  that  sat¬ 
isfy 


(8.11) 


inf  sup  |a(u,v)!  2  '?  =  6  (h)  >  0 

UeSl,h  V"H2,h 
u  X=1  v ■ 2=1 


(8.12)  sup  |a(u,v)|  >  0,  for  each  v  e  S.  ,  with  v  *  0. 

u<-:S  .  2'h 

1  ,  h 

We  also  assume 

(8.13)  for  every  u  e  H  ,  lim  '?(h)  1  inf  u-t;  =  0. 

1  h-*0  L  1 

1  ,  h 

We  note  that  if  dim  h  =  dim  S2  then  (8.12)  follows  from 

(8.11).  We  assume  dim  S.  .  =  dim  S„  ,  for  the  remainder  of  this 

1 ,  h  2  ,  h 

article.  S  .  and  S0  .  are  referred  to  as  test  and  trial 
1  ,  h  2  ,  h 


spaces,  respectively,  and,  if  they  consist  of  piecewise  polynomial 


functions  they  are  called  finite  element  (approximation)  spaces. 


(8.11)  is  referred  to  as  the  discrete  inf-sup  condition. 

We  then  consider  eigenpairs  of  the  form  a  relative  to  the 


form  b, 

but 

now  restricted  to  S,  , 

1  /  ii  2  ,  h 

i.e. ,  pairs  < A h - uh 

where  \. 

h 

is 

a  number  and  0  *  u,  e  S,  ,  , 

h  1 ,  h 

satisfying 

(8.14) 

a ( uh ' v  >  =  *hb(uh,v),  V  v  e 

S  2  ,  h ' 

and  use  and  uh  as  approximations  to  *  and  u,  respec¬ 

tively.  (8.14)  is  called  a  variational  approximation  method  or 
Galerkin  method  for  (8.10)  in  general  and,  if  S  and  S 

1  t  XI  Z  f  i  1 

consist  of  piecewise  polynomial  functions;  it  is  called  a  finite 

element  method .  Since  N  =  dim  S1  h  =  dim  S2  h  c  ®,  (8.14)  is 

equivalent  to  a  matrix  eigenvalue  problem.  In  fact,  if  0  , . . . ,0 

and  1 are  bases  for  S1  h  and  S2  respectively,  the 

N 

(^h,uh  =  ^zjdj)  is  an  ei9enPair  of  (8.14)  if  and  only  if 
j  =  l 

(8.15)  Az  =  \  .  Bz, 

h 

where  z  =  (z1 . zN)T, 

A  =  ( A± j )  ,  Ai j  =  a(0^,vi) 

and 


B 


(B, 


11 


B 


il 


b  ( 0  j  ,  ip  i  )  . 


(*h,uh^  is  an  eigenpair  of  (8.14)  if  and  only  if  ('h1,uh^ 
is  an  eigenpair  of  the  compact  operator  — *  S1  h  defined 

by 


(8.16)  a ( T  u , v )  =  b ( u , v ) ,  V  u  ^  H  ,  v  S0 

h  1  2 ,  h 


The  operator  T  can  be  written  as  P  T,  where  P  is  the  pro¬ 


jection  of  H  onto  S 


-3 


defined  by 


(8.17) 


a(Phu,v)  =  a  ( u ,  v ) ,  v  u  £  H1 ,  v  e  S2  h> 


Using  the  central  result  in  Babuska  [1971]  and  Babuska  and  Aziz 
[1973,  Chapter  6],  it  follows  from  (8.1)  -  (8.3),  (8.11),  and  (8.17 

that 


llu-phu!i  5  (1  +  rrhi’  if  lu-' V 

*  1  .h 


Thus  from  (8.13)  we  see  that  P, 


I  pointwise.  Since  T  is 


compact ,  =  P^T  — >T  in  norm  on  . 

Let  X  be  an  eigenvalue  of  (8.10)  with  algebraic  multipli¬ 
city  m,  by  which  we  mean  that  X  1  is  an  eigenvalue  of  T  with 
algebraic  multiplicity  m.  Let  a  =  ascent  of  >  1-T.  Since  Th 
— >  T  in  norm,  m  eigenvalues  X ^ ( h) , . . . , X  ( h)  of  (8.14)  will 
converge  to  X.  The  X^(h)  are  counted  according  to  the  alge¬ 
braic  multiplicities  of  the  /j  j(h)  =  Xj(h)  1  as  eigenvalues  of 

T,  .  Let 
h 

(8.18)  M  =  M(X)  =  (u  :  u  a  generalized  eigenvector  of  (8.10) 

corresponding  to  X,  :!u!^  =  1), 

*  * 

(8.19)  M  =  M  (X)  =  (v  :  v  a  generalized  adjoint  eigenvector  of 

(8.10)  corresponding  to  ',  v  2  = 


(8.20)  =  M^(X )  =  (u  :  u  in  the  direct  sum  of  the  generalized 

eigenspace  of  (8.14)  corresponding  to  the 
eigenvalues  X^(h)  that  converge  to 
'  .  u  2  =  1  )  , 


and  define 
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(8.21) 


F ,  =  F(x)  =  sup  inf  uM 

ft  ft  __ .  -  /  _  1 

utM(»  )  f^S.  , 

1  ,  h 


and 

(8.22 


<h  = 


sup 

V-IM*(>  )  !?€  S 


inf  v-i? 


2 ,  h 


Let  M(V)  =  R ( E )  and  Mh ( \ )  =  R ( Eh ) . 

We  now  state  and  prove  four  results  which  correspond  to 
Theorems  7.1  -  7.4.  Let  a  denote  the  ascent  of  *  1-T. 

Theorem  8.1.  There  is  a  constant  C  such  that 


(8.23) 


6  (M(\ )  ,Mh(\  )  )  <  C/?(  h)  ch, 


Theorem  8.2.  There  is  a  constant  C  such  that 

m 


(8.24 


|X_(m  XXj(h>  1]  ^  ?  C8{h)  l£hc 
j  =  l 


Theorem  8.3.  There  is  a  constant  C  such  that 


(8.25) 


-j(h)  I  <  C(Mh)  1Khc  *h)1/a 


Theorem  8.4.  Let  Mh)  be  an  eigenvalue  of  (8.14)  such  that 
lim  Mh)  =  V.  Suppose  for  each  h  that  w,  is  a  unit  vector 


h-*0 


h 


- 1  k 

satisfying  (Mh)  -T)  w,  =  0  for  some  positive  integer  k  i  o 


Then,  for  any  integer  ?.  with  k  <  f  <  a,  there  is  a  vector 
such  that  (M1-T)^u,  =  0  and 


uv 


8.26) 


u  -w  h 
h  h  1 


CCMhfMM**1'"' 


Proofs .  The  eigenvalues  and  generalized  eigenvectors  of  (8.10) 
and  (8.14)  have  been  characterized  in  terms  of  the  compact  opera- 
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tors  T  and  and  we  know  that  Th 


T  in  norm.  Thus  we  can 


apply  the  results  in  Section  7,  with  X  =  and  T  and  as 

defined  in  (8.8)  and  (8.16),  to  estimate  the  eigenvalue  and  eigen¬ 


vector  errors.  Note  that  M  =  R(E)  ~  (unit  sphere  in  H1 


where 


E  is  the  spectral  projection  associated  wiht  T  and  \  ,  and 

Mh(M  =  R(E^)  '  (unit  sphere  in  H^)  where  E^  is  the  spectral 
projection  associated  with  T^  and  \.*(h),  j  =  Consi¬ 

der  first  the  proofs  of  Theorems  7.1  and  7.4.  These  results  will 
follow  immediately  from  Theorems  7.1  and  7.4,  respectively,  if  we 


show  that 


(8.27) 


'<T*V Ini*)*  s  ®(h>"V 


From  Babuska  [1971]  and  Babuska  and  Aziz  [1973,  Chapter  6]  and 
(8.1)  -  (8.3),  (8.6),  (8.8),  (8.11),  and  (8.16)  we  have 


<T-VUli  J  <*  +  TThT’  i?f  'Tu->'  I' 

1  l.h 

Since  M  =  R(E)  is  invariant  for  T,  for  u  a  R(E)  we  obtain 


inf  Tu-t  '  i  r  T  u  . 

“si.h  1  h 


(8.27)  follows  from  these  two  estimates. 

Now  consider  the  proofs  of  Theorems  8.2  and  8.3.  The  right 
hand  side  of  (8.4)  is  bounded  by 


^Wd’:1  +  (T-Th|lR(E)  <T‘-ThHR(E‘>  ■ 


i .  J-l 


w«  now  show  that  this  quantity  can  be  bounded  by  C i ( h ) 


h  h ' 


u  with  u  =  1  and  for  v  c  R ( E  )  with  v  =  1 


h  ^  v  e 


ktftAI, 


it 


( (T-Th)u, v  ) 2  =  a( (T-Th)u, A'  v  ) 

=  a  (  (T-Th)u,  A'_1v*-7? 


C1  (T"Th)U  1  A'  V  ^2'  V  11  "=  S2,h‘ 


We  have  here  used  the  definition  of  the  operator  A,  (8.8),  a 
(8.16).  Recalling  the  A'-1  maps  R  ( E*  )  =  N ( A  ~  ^ -T * ) * 1 )  onto 
N(  A  ~  ^-T*  )a  )  =  M*  (A  )  ,  we  get 

(  (T-Th)u,v*)1  <  Cji:  (T-Th)u,i  1  A'_1  £*. 


From  this  it  is  immediate  that 


*  *  * 


*  *  * 


(8.28) 


(T  -Th)v  ;  =  sup  I ( u , ( T  -Th)v  ) 

U-:H1 

u  1  =  1 


C,  ii  T-T.  :  A'_1  v:  *  V  v*-:R(E*  )  with  v*  , 
inn  1 


(  (T-Th)»  i  j  )  ]  -  ((T-Th)«rE 


8 . 29 


C1  E  IT-Th'lR,EI  *■  h 


Cl|l,l"‘  h'h- 


Now,  using  (8.27)  -  (8.29)  we  get 


i  ,  j  =  l 


((T  Th)oj'°j)i[  +  (T  Th  j  ■ R ( E )  lT  Th  1  ^  R ( E  * ) 


0  ••  i  h  :  ' 


Thus  Theorem  8.2  follows  from  Theorem  7.2  and  Theorem  R.3  from 
Theorem  7.3. 

Remark  8.1.  The  proof  we  have  given  f nr  (8.24'.  together  with 
(8.12) ,  shows  tha r 


(8.30) 


I  x-\  (h)  |  ^  C/?(h)£h£h, 

where 

m 

(8.31)  X(h)  =  i  ^  X  ^  (  h)  . 

j  =  l 

We  end  this  section  by  specializing  the  results  to  the  Ritz 
method  for  selfadjoint,  positive  definite  problems  and  then  pre¬ 
senting  a  lower  bound  for  the  eigenvalue  error.  Suppose 

-  H,  !!•!!„  =  N  •  (I  is  a  real  Hilbert  space.  Let  a{*,*)  be  a 
H 

symmetric  bilinear  form  on  H  satisfying  (8.1)  and 

(8.32)  a(u,u)  >  a!!u!!^,  Vu  e  H, 

with  o  a  positive  constant.  Note  that  (8.1)  and  (8.32)  imply 

1/2  1/2 
that  a(u,u)  and  (•  u(!  are  equivalent  norms;  a(u,u)  is 

often  called  the  energy  norm  of  u.  Let  b  be  a  symmetric  bilin¬ 
ear  form  on  W  satisfying  (8.7),  with  =  W  o  H  compactly, 

and  satisfying 

(8.33)  b(u,u)  >  0,  V  nonzero  u  e  H. 

(8.32)  implies  (8.2)  and  (8.3)  are  satisfied.  We  note  that  (8.1) 

and  (8.32)  show  that  a(*,*)  is  equivalent  to  the  given  inner 

product  ( • , • )  =  ( • , • ) H  on  H .  We  wi 1 1  now  take  a ( • , • )  to  be 

the  inner  product  on  H  and  take  !!  •  !i  =  /a  ( •  ,  •  )  to  be  the  norm. 

* 

We  see  that  T  =  T«  =  T.  Thus  T  is  selfadjoint;  it  is  also  pos¬ 
itive  definite.  It  is,  of  course,  compact.  Let  S  =  S  .  =  S, 

1  ,  h  2  ,  h  h 

H  be  a  family  of  finite  dimensional  spaces  satisfying  (8.13). 

In  this  case  the  variational  approximation  method  (8.14)  is  called 
the  Ritz  method.  (8.11),  with  Mh)  =  <»  ,  and  (8.12)  follow  from 

(8.32)  . 


Under  these  hypotheses,  the  problem  (8.10)  has  a  countable 
sequence  of  eigenvalues 


0  <  .  »2  „  V3  s  ...  /■  « 


and  corresponding  eigenvectors 


V  u2'  u3 


which  can  be  assumed  to  satisfy 


(8.34 


a  ( u  .  ,  u  . )  =  A.  .  b  ( u  .  ,  u  . )  =  a.. 6.. 
i  J  3  1  y  J  i] 


(cf.  Section  4).  In  the  sequence  { A.  ^  }  ,  the  are  repeated 

according  to  geometric  multiplicity.  Furthermore,  the  A  .  can  be 
characterized  as  various  extrema  of  the  Rayleigh  quotient 

R(u)  = 

b  ( u ,  u ) 

We  state  these  characterizations  now. 


Minimum  Principle 


(8.35) 


min  R(u)  =  R(u  ) , 

u-:H 


min  R ( u )  =  R  ( uk )  ,  k 
u-:  H 


=  2,3, 


a(u,u. )=0, i=l , . . . , k- 1 


Minimum-Maximum  Principle 


(8.36)  A.  =  min  max  R(u)  = 

VH  ucvk 

dim  V.  =k 
k 


Maximum-Minimum  Principle 


max  R ( u ) ,  k  =  1,2, 


o  uk  sp(u1 . uk; 
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. zk_!  H 


min  R ( u ) 
u  H 

a(u,zJ)=0,i=l,....k-l 


m 


min  R(u) , 
u-:H 

a(u,u . ) =0 , i=l , . . . , k~l 


k  =  1,2, 


Likewise  (8.14)  (with  S,  ,  =  S.  .  =  S.  )  has  a  finite 

1 , n  2  ,  n  n 


sequence  of  eigenvalues 


0  <  *l,h  S  *2.h  '  5  Vh-  "  *  dl”  V 


and  corresponding  eigenvectors 


U1 ,h*  ‘ *  * ' UN , h ' 


which  can  be  taken  to  satisfy 


{8.38)  a(u  ,u  )  =  \  ,b  ( u .  ,  ,u.  .  )  =  A.  ,6  . 

l  /  n  j  /  h  j  /  n  i  ,  h  j  /  n  j  /  n  i  /  j 

For  the  A,  .  we  also  have  extremal  characterizations 
k ,  h 


Minimum  Principle 


(8.39) 


min  R(u)  =  R  (u_  ,), 

-  c  1 ,  n 

U^  b, 

n 


min  R(u)  =  R  { u,  ,  )  ,  k 

u-rS.  K,n 

h 


.  .  ,  N 


a  ( u ,  u  .  )=0,i  =  l . k-1 

i  ,  h 


Minimum-Maximum  Principle 


(8.40)  A, 


V.  .--S, 
k ,  h  h 

dim  V,  ,  =k 
k ,  h 


Maximum-Minimum  Principle 


max  R(u ; 
u-'V,. 


max  R ( u ) , 
U'rUk,h-sp(ui,h'"‘ 


k  =  1,2 


8.41)  * 


Z1  ,  h . 2k-l  ,  h  H 


min  R ( u ) 

U  Sh 

a  (  u  ,  z  .  ^)=0,i  =  l . k-1 


(8.46) 


U,  -u.  .  , 

k  k ,  h  1 


h' 


To  be  more  precise,  if  u  ,u2,...  satisfy  (8.34),  then 

u  .  ,u„  u„  .  can  be  chosen  so  that  (8.38)  and  (8.46)  hold. 

1 ,  h  2  ,  h  N  ,  h 

Regarding  these  applications  of  Theorems  8.1  -  8.4,  see  the  dis¬ 
cussion  of  the  selfadjoint  case  at  the  end  of  Section  7. 

2 

We  have  shown  that  |A-A(h)|  s  Cc h ,  where  A(h)  is  any 
eigenvalue  that  converges  to  A  .  We  now  show  that  if  A  is 
simple,  then  the  error  has  the  lower  bound 


2 

| A -A  ( h ) |  >  Ct ,  ,  C  a  positive  constant. 

These  results,  together,  imply  that  the  eigenvalue  error  is  of  the 
same  order  as 

Theorem  8.5.  (Kolata  [1978]).  Suppose  we  are  in  the  selfadjoint, 
positive  definite  case  discussed  above  and  suppose  '  is  a  simple 
eigenvalue  of  (8.10),  i.e.,  it  has  (geometric)  multiplicity  one, 

A (h)  is  defined  by  the  Ritz  method,  and  A  (h)  ^  ' .  Then  there 

is  a  positive  constant  C  such  that 


(8.47)  | A  ( h ) -  A  |  O 

for  small  h. 


Proof.  Let  o  be  a  unit  eigenvector  of  (8.10).  Combining  (7.6) 
and  (7.7)  yields 


(8.48) 


-  1 


'  (  h) 


=  a(  (T- 


T.  )o 
h 


+  a 


(  T-T,  )n  ,  L  e  -o  ) 
h  h 


a  (  (  T-T.  )o  , o  ) 
h 


I  a  (  (  T-T. 

h 


)<•  ,  L 


where  L 


is  'he  projection  introduced  following  i7.7) 


Now 


using  (8.8),  (8.16),  and  the  definition  of  ,  we  have 


(8.49) 


a((T-Th)0,«M  =  A  a(  (T-Th)<p  ,T0  ) 


=  A  a(  (T-Th)o  ,T<P-Tho  ) 


-  Ai  <T-Thla:a 


- 1  2 
=  V  <t>-\  T.  O  \ 

h  a 


- 1  2 
>  v  inf  j; 0 

--sh 


-1  2 
=  '  *h 


and,  using  (8.27)  and  (8.28),  we  have 


a(  (T-T.  )0  ,L,  0-0  )  |  <  i:  (T-T.  )0: ■  :  L,  0-0  a 
n  n  n  a  n  a 


(8.50) 


s  CMT-Th)0:a  (T-Th)o  a 


C!T'Th  S’ 


Combining  (8.48)  -  (8.50)  we  obtain 


-1  -1  -12  2 
A  -  Mhl  ->  l  cl  -  C  T-Th  ,h 


=  T"Th  1 


■  >  ^ A  *  /  2  ,  for  h  smal 1 . 
h 


From  this  we  obtain 


Mh)-'  '  [h)>  h  2 


(a  ' 2 )  ■  ,  ,  for  h  small 
h 


Remark  8.2.  If  '  is  a  multiple  eigenvalue,  one  can  prove 

2 


! ' - ' ( h) ! 


C 


inf  inf  «.*>-( 


M(  a  )  i  S. 


C f .  Theorem  9  .  1 


Section  9.  Refined  Estimates  for,  Selfadjoint  Problems 

In  the  previous  section  we  presented  error  estimates  for 
variationally  formulated  problems  and  at  the  end  of  the  section  we 
specialized  these  results  to  the  Ritz  method  for  selfadjoint,  posi 
tive  definite  problems.  Because  of  the  importance  of  this  case  in 
applications  we  now  present  an  alternative  analysis  due  to  Babuska 
and  Osborn  [1987].  This  analysis,  which  is  somewhat  more  direct 
and  self-contained  than  that  in  Section  8,  yields  stronger  results 
than  those  in  Section  8  in  the  case  of  multiple  eigeneigenvalues . 

As  at  the  end  of  Section  8,  we  assume  that  a(*  ,* )  is  a  sym¬ 
metric  bilinear  form  on  H  satisfying  (8.1)  and  (8.32),  that  . 
b(*,*)  is  a  symmetric  bilinear  form  on  W  satisfying  (8.7), 
with  W1  =  W2  =  W  o  H  compactly,  and  (8.33).  We  take  a(*,*) 


and  i1  •  !;  =  /a  ( •  ,  •  )  to  be  the  inner  product  and  norm  on  H  and 

a 


set  !i  •  I:  £  =  /b (  •  ,  •  )  .  Then,  as  stated  in  Section  8,  the  eigenval 
problem  (8.10),  i.e.,  the  problem 


ue 


(9.1) 


J 


fu  e  H ,  u  *  0 


a(u,v)  =  b(u,v),  V  v  s  H 


has  a  countable  sequence  of  eigenvalues 


0  <  K 


x  2  '  •  •  •  +"'L 


and  corresponding  eigenvectors 


U1  ,U2 . 


which  can  be  chosen  to  satisfy 


9.2) 


a  (  u  .  ,  u  .  )  =  '  .  b  (  u  .  ,  u  .  )  =  '  . 

i  j  J  1  J  J  1 J 


Furthermore,  any  u  ■  H  can  be  written  as 


(9.3) 


u 


with 


>  a  .u  . 

Z-  J  ] 

j  =  l 


=  a(u,u2 ) , 


with  convergence  in 


!:  (cf.  (4.10)  and  (4.11)).  We  assume 


c  H  is  a  family  of  finite  dimensional  spaces  satisfying  (8.13) 
with  .3(h)  =  a.  The  eigenvalue  problem  (8.14)  with  si  h  =  S2 
S^,  i.e.,  the  problem 


(9.4) 


uh e  V  %  '  0 

a  ( u.  ,  V )  =  Xhb(u,,v),  V  v  Sh 


has  a  finite  sequence  of  eigenvalues 


0  <  *  ,  .  <  A  n  . 

1 ,  h  2  ,  h 


and  corresponding  eigenvectors 


* N , h '  N  =  dim  Sh' 


U1  ,h . uN,h' 


which  can  be  chosen  to  satisfy 

( 9 • 5 )  a(u.  .  ,u.  .  )  =  A  ,b(u.  .  ,u  ,  )  =  A  .6  .  .  . 

i / h  j t h  j  j  n  i/h  j.h  i.hij 

The  A k  and  a^  ^  satisfy  the  extremal  principles  stated  in 
Section  8. 

Our  analysis  makes  use  of  the  following  lemma  that  expresses 
a  fundamental  property  of  eigenvalue  and  eigenvector  approximation 

Lemma  9.1.  Suppose  (a,u)  is  an  eigenpair  of  (9.1)  with  u 
=  1,  suppose  w  is  any  vector  in  H  with  w  ,  =  1,  and  let  a 


=  a ( w , w ) .  Then 


9.6) 


2  ,  2 

A  -  A  =  w-U  -  A  w-U  ,  . 

a  b 


(Note  that  we  have  assumed  u  and  w  are  normalized  with  respect 


Eli] 


to  1  •  '■  here,  whereas  m  (9.2)  and  (9.5)  we  assumed  u.  and 
b  l 

u.  ,  are  normalized  with  respect  to  !;  •  .) 

l ,  h  a 

Proof .  By  an  easy  calculation, 

I:  w— u !'  2  -  Ajlw-u!1?  =  !iwi!2  -  2a(w,u)  +  ,'u!:2 
a  b  a  a 

-  A!|w>!2  +  2Ab(w,u)  -  Mu!'2. 


Then,  since 


:!  w !! .  =  '!  u!i .  =  1, 

b  b 


'!wi|  =  A  , 

a 


!lul!‘  =  A  , 

a 


a(w,u)  =  Ab(w,u) , 


we  get  the  desired  result. 

For  i  =  1,2,...  suppose  A  is  an  eigenvalue  of  (9.1 

l 

with  multiplicity  q^  ,  i.e.,  suppose 


A,  ,  <  A,  =  A, 
k  .  -1  k .  k . +1 

i  li 


*  k .  +q  .  - 1  <  ^k.+q.  '  k  .  + 1  ' 

l  i  i  l  l 


Here  k^  =  1 ,  k^  is  the  lowest  index  of  the  2nd  distinct  eigen 
value,  k^  is  the  lowest  index  of  the  3rd  distinct  eigenvalue, 
etc.  Let 


(9.7) 


V1-*1 


inf  u-t 


j  =  1  .  • 


U‘  MM  ) 
l 


alu.u  )  =  ... 
1 


=  alU'Uk..j-2,h'  "  ° 
1  J 


where  M  M  ^  )  is  defined  in  (8.18).  The  restrictions  a(u,u^r 
i  i 


=  . . .  =  a(u,u,  ,  „  .  )  =0  are  considered  vacuous  if  1=1. 
k, + j-2 , h 


Note  that  ;-^(h,l)  =  where  >-  ^  { *  ^  )  is  defined  in  (8.21). 

We  now  estimate  the  eigenvalue  and  eigenvector  errors  for  the 
Galerkin  (Ritz)  method  (9.4)  in  terms  of  the  approximabi 1 i ty 
quantities  i^(i,j). 


Theorem  9.1.  There  are  constants  C  and  h^  such  that 


2  ,  .  . 


<9'8)  ‘k.+j-l.h'V+j-l  s  cch(1'j''  v  0  <  h  s  V  J  -  l--- 
1  J  1  J 

i  =  1,2, 


and  such  that  the  eigenvectors  u^Ug,...  of  (9.1)  can  be  chos 
so  that 


en 


«*•»>  uk . + j-l , h"uk . + j -1  a  s  C£h(i’3»'  V  0<h5h0'  1  -  1 . *i- 

1  J 

i  =  1,2,..., 


and  so  that  (9.2)  holds. 

Remark  9.1.  (9.8)  should  be  compared  with  (8.43),  (8.47),  and 

(8.51). 


Proof .  Overview  of  the  Proof .  The  complete  details  of  the  proof, 
which  proceeds  by  induction,  are  given  below.  Here  we  provide  ar. 
overview.  In  Step  A  we  give  the  proof  for  i  =  1.  The  proof  is 
very  simple  in  this  case  and  rests  entirely  on  the  minimum  prin¬ 
ciple  (8.39)  and  Lemma  9.1. 

The  central  part  of  the  proof  is  given  in  Step  B.  There  we 
prove  the  theorem  for  i  =  2,  proving  first  the  eigenvalue  esti¬ 
mate  (9.8)  and  then  the  eigenvector  estimate  (9.9).  In  particular 
in  Steps  B.l  and  B.2,  estimates  (9.8)  and  (9.9),  respectively,  are 
proved  for  j  =  1.  We  further  note  that  the  argument  used  in  Step 
B  proves  the  main  inductive  step  in  our  proof,  yielding  the  result 


for  i 


i  +  1  on  the  assumption  that  it  is  true  for  i 


be  somewhat  more  specific,  the  argument  in  Step  B .  1  proves  (9.8) 


directly  for  any  i  >  2  (and  j  =  1)  and  that  in  B.2  proves  (9.9) 
for  i  =  _i  +  1  (and  j  =  1)  under  the  assumption  that  h  - 

uJ!  - *0  as  h >0  for  t  <  k.  .-1  (cf.  (9.30). 

cl  111 

Details  of  the  Proof.  Throughout  the  proof  we  use  the  fact  that 
£^(i,j)  can  also  be  expressed  as 

£.(i,  j)  =  inf  inf  -iu-t  ! 

ueM(\k  )  t£Sh 

(9.7-)  1  »U  "«lc.  ,h>'-  •  •**'*  ' 

1  1  J 

a(U'Uk..h)  =  ---=a(U'Uk.<.j-2,h,I° 

1  1  J 


Step  A.  Here  we  prove  the  theorem  for  i  =  1. 

Step  A . 1 .  Suppose  l,  (k,  =1)  is  an  eigenvalue  of  (9.1)  with 

*1  1 


multiplicity  q1  ,  i.e.,  suppose  x i  =  ^ 2  = 


<  ^ 


In  this  step  we  estimate  *  ^  h  -  X  ^  ,  the  error  between  *  ,  and 

the  approximate  eigenvalue  among  *  . . x  ,  that  as  1  'sen' 

1  ,  n  4  2  •  n 


to  i.e.,  we  prove  (9.8)  for  i  =  j  =  1.  Note  that 


',(1,1)  =  inf  inf  u-t 
n  u-  M(  x  l  )  v  Sh  a 

is  the  error  in  the  approximation  by  elements  of  ,  I  t  h*-  t 

easily  approximated  eigenvector  associated  with  *  , . 

From  the  definitions  of  /  .(1,1)  we  see  that  there  ; 1 

h 

M(  x  )  and  an  s,  S,  such  that 
1  h  h 


(9.10)  u,  -  s.  ='(1.1). 

h  ha  h 


Let 
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TVJVW 


Uv 


u  = 


/biG^G^ 


'  sh  = 


/b,sh'sh: 


By  the  minimum  principle  (8.39)  we  have 


9.11) 


h.h  "  h  -  a(sh'sh'  ‘  h 


Now  apply  Lemma  9.1  with  (\,u)  =  (\  , u^ ) ,  w  =  s^,  and  \ 


a(s,  ,  s,  ).  We  obtain 
h  h 


alVV  ■  h 


2  ,  -  2 
s ,  -u.  -  t  ,  s,  -u.  . 

h  h  a  1  h  h  b 


(9.12 


2  2 
s,  -u.  C  s,  -u, 

h  h  a  h  h  a 


(9.10)  -  (9.12]  yield  the  desired  result. 


Step  A 


In  this  step  we  prove  [9.9)  for  i  =  j  =  1.  Let 
re  eigenvectors  of  ; 9 . i ;  satisfying  (9.2).  Write 


wo  have 


\ 


\ 


2.0 


(9.14) 


s  cu  -  v\«>"1/:Vi-i>- 


"  ( l ) 

Z  a u  . 

4  =  1  J  J 


i  =  1  J  J 

Redefining  u  to  be  — - - ,  we  easily  see  that  iu1 

'i  £  a 1  *  u  .  ii 

j=l  3  3  3 

1,  so  that  (9.2)  still  holds,  and  from  (9.14)  we  obtain 


(9.15) 


!ul,h  '  Va  s  ^h'1'1’1 


as  desired.  Note  that  u  may  depend  on  h. 

Step  A.3.  Suppose  q1  >  2 .  Prom  (9. 7')  we  see  that 


(9.16) 


«h(l,2)  = 


'!  u-  x  1 


ue  M  (  X  1 ) 
a(u'ui,h)=0 


aU'ui,h)=0 


Choose  u.  -  M  (  X  )  with  a(u,  ,u,  ,  )  =  0  and  s.  e  S,  with 
n  l  h  1 ,  h  h  h 

a^Sh,Ul  h^  =  0  so  t^at 


(9.17) 


!  Uh-Sh  '  a  *  V1'2)' 


and  let 


Uh  = 


'  Sh  = 


'b(W 


/b(sh'sh) 


Since  a(sh'ui  ^)  =  0,  from  the  minimum  principle  (8.39),  Lemma 


9.1,  and  (9.17),  we  have 


(9.18) 


*h,2  -  2  !  “sh  -  Va  '  Cfhll'2>' 


This  is  (9.8)  for  i  =  1  and  j  =  2 


Step  A. 4.  In  Step  A. 2  we  redefined  u^  .  Now  redefine  u^ 

so  that  u  , ...,u  are  a-orthogonal .  Write 
1  ql 


uh,2  “Za32)ur 

j  =  l 


Now,  proceeding  as  in  Step  A. 2  and  using  (9.18),  we  have 

CD  CD 

11  -  v\+i>  X  R21)  s  'IH2,j u  -  v* 

j=q1+i  j=i 


v’.  i  R21)  s  'IR2)) u 


-  *  n/\  • 
2  3 


j=q1+i 


=  !a(U2,h'U2,h)-'2b(U2,h 
(*2,h  "  *2)X2,h 


<  Cc‘(l,2)  . 


Thus 

(9.19) 

But  by  (9.15), 


U2.h  -  Za12,ul"a  s  C£h(1'2> 


■i2’  * 


(9.20) 


a(U2,h 

,u:  ) 

a(U2,h 

t'V 

U2,h 

:  u 
a 

C£h(1' 

1) 

C^h'1' 

2)  . 

Combining  (9.19)  and  (9.20)  we  get 


qi  qi 


2u,  h  -  Va<2>u.  s  !'  u„  h  -  Ya<2V.  +  aj2>u, 

2 ,  h  Z-J  ja  2 ,  h  Z-J  ja  1  1< 

j-2  j=l 


C^h(  1  ,  2) 


Redefining  u2  to  be 


we  see  that  ju 


2  a 


=  1  an 


1  a52’Va 

j=2  3  3 


a(u^,u2)  =  0,  so  that  (9.2)  holds  and 


(9.21) 


,U2,h  "  Va  5  Cf'h(1'2>' 


which  is  (9.9)  for  i  =  1,  j  =  2. 


Step  A. 5.  Continuing  in  the  above  manner  we  obtain  the  proof  of 


(9.8)  and  (9.9)  for  i  =  1  and  j  =  l,...fq  . 


Step  B.  Here  we  prove  Theorem  9.1.  for  i  =  2. 


Step  B . 1 .  Suppose  (k2  =  q1+l )  is  an  eigenvalue  of  (9.1)  of 


multiplicity  q2 .  In  this  step  we  estimate  *  ^  h  ~  '  t^ie 


error  between  k  and  the  approximate  eigenvalue  among 

K2 


X.  .  ,  .  that  is  closest  to  1,  .  Note  that 

k2,h  k2+q2-l,h  k2 


(9.22) 


r. . (2,1)  =  inf  inf  hu-t!  . 

fl  W  t  \  gy  SI 

u‘5M(,k2l  ,t;sh 


Introduce  next  the  operators  T,T^  :  H — >H  defined  in  (8.8 


and  (8.16),  respectively,  i.e.,  the  operators  defined  by 


Tf  e  H 

a ( Tf , v)  =  b( f ,v) ,  V  v  e  H 


and 


'V  *  sh 


a ( Thf , v)  =  b(f,v),  V  v  €  sh 


It  follows  from  (8.1),  (8.7),  (8.32),  and  (8.33)  that  T  and  T 


are  defined  and  compact  on  H.  Furthermore 


(9.23) 


I!  ( T-T.  )  f  SI  <  C  inf  ijTff  -  *  ||  . 

rsSh 


We  now  suppose  the  space  H  and  the  bilinear  forms  a  and 

b  have  been  complexified  in  the  usual  manner.  Let  r  be  a 

circle  in  the  complex  plane  centered  at  /j .  =  enclosing  no 

2  *2 

other  eigenvalues  of  T.  Then  for  h  sufficiently  small,  f 
r  c  P(Th)  and  int  (D  n  cr ( Th )  =  (u^ . “kj+qj-l.h1'  where 

X. ^  +i  =  +  Also,  as  we  have  seen  in  Section  6,  E(^  )»  the 

spectral  projections  associated  with  T  and  ,  and  Eh(/-<k  )  , 

the  spectral  projection  associated  with  T.  and  /.<  .  .  ,  i  = 

n  ^2^1 / n 


,q2~l,  respectively,  can  be  written  as 


(9.24) 


(9.25) 


E(^k2)  "  2¥T 


Eh(^k2)  2tri 


( z-T ) -1dz 


(z-Th)  dz 


Let  u  e  R(E(/Jk  )).  Then  vh  =  Eh ( )  u  e  H(E {//^  )),  and 
from  the  formulas  (9.24)  and  (9.25)  we  obtain 

I!U  -  vh.ia  .  -  Eh(Pk2l)uia 

r 

=  II  2^J  (z-Th)_1(T-Th)  (z-T)_1u  dzl! 


(9.26) 


■  si*t  <z-V1(T-Vf£-  d2i 

Jr  k2 

5  h  l2v  rad(r)]  sup  !!  (z-Th)_1.,  ^5^- 


i‘  (  T-T,  )  u,: 

n  a 


(/'k2+q2-l.h  -  %+rad(r))  ;;(T-Th)u:a 


<  C!i  (T-Th)uil  . 


(9.23)  and  (9.26)  yield 


■  u-v  '  <  C  inf  Tu-t  : 

3  'eSh 


(9.27) 


=  C  inf  :/j.  u-t  ! 


<  C  inf  ]:u-t!'  . 

•^Sh 


This  is  an  eigenvector  estimate;  it  shows  that  starting  from  any 


u  e  R ( E ( // .  ))  we  can  construct  a  v.  =  v.  (u)  e  R(E.  (a*.  ))  that 

k2  h  h  h  k2 


is  close  to  u.  We  now  use  (9.27)  to  prove  the  desired  eigenvalue 
estimate . 

By  the  minimum  principle  (8.39)  we  have 


(9.28) 


A  -  A 

k2.h  k2 


inf 

veS 


a( v, v)  -  x 


h 
=  1 


a ( v ,u .  .  )=0, 
i ,  h 


i=l , . . . ,k2~l 


Since  v.  =  R(E,  (//.  )),  we  know  that  a(v,  ,u.  ,  )  =  0,  i  = 

n  n  k2  n  l ,  n 


l,...,k2-l.  Thus,  from  (9.28)  we  find 


x ,  .  -  x .  <  a 

k2,h  k2 


'•  v  11  '  11  v 

h  b  h 


-  x 


V 


Combining  this  with  Lemma  9.1  and  (9.27)  we  obtain 


<  C  inf  u-j 


for  u 


(9.29) 


R(E(/v.  ))  with  u  =  1.  Hence,  using  (9.22), 

Kj  a 


Vh  k2 


C  inf  inf  »u-» 
u,M(^2)  ^-Sh 


=  Cf  ‘  (  2 , 1  )  , 


which  is  (9.8)  for  i  =  2,  j  =  1. 

Comment  on  Inequality  (9.29).  A  careful  examination  of  the  proof 

of  (9.29)  shows  that  C  depends  only  on  /n.  -  and  /*. 

K1  k2  2 

/j,  ,  but  is  independent  of  h,  and  that  (9.29)  is  valid  for  h 

K3 

<  hQ ,  where  hQ  is  such  that  h  <  hQ  implies  r  -  pfT^), 

int  tn  n  a(Th)  =  (u^  h . and  Uk2‘uk2+q2-l  ls 

small,  say  u.  -u.  ,  ,  ,  :  rad  (T)/2.  Note  that  if  we  were  con- 

k2  k2+q2-l,h 

sidering  a  family  of  problems  depending  on  a  parameter  r,  we 


could  bound  C  =  C(t  )  above,  independent  of  r,  provided  //.  (<  ) 

kl 

-  it.  (t  )  and  //.  .-/J ,  (  r  )  were  bounded  away  from  0,  and  we 

*2  k2(r)  k3 

could  bound  hQ(r)  awaY  from  0  if  T(i)  -  /’(T^(‘)!.  Int  (Mr)) 

•,|Th")l  =  <uk2,h . uk2*q2-l,hK  lnd  "k2|r|-'k2tq2-l,h1'1 

rad  ( T ( r  )  )  / 2  ,  uniformly  in  r. 

Step  B.2.  Suppose,  as  in  Step  B.l,  that  M  has  multiplicity 

- - 

q2 .  We  have  shown  in  Step  A. 5  that  we  can  choose  the  eigenvectors 
u^.u^,...  of  (9.1)  so  that  (9.2)  holds  and  so  that 


(9.30) 


C-*  (  1  ,  i  )  ,  j  =  1 . q ,  =  k  -  1  . 


ft] 


Write 


no 


(9.31) 


V  k2 
Uk  ,h  ZPj  Uj' 
3=1 


From  (9.31)  we  have 


'XJ 

I[* 


(  k2  )t  2 


j=l 


(1  -  \2/^)i  =  la(uk2>h,uk2>h)  -  lk2b(uk2(h,uk2(h) 


-1 


=  Uk2,h  Xk2Uk2,h' 


which,  together  with  (9.29),  yields 


V1 


(9.32)  | 


(k„H2 


00 


I  p  ] 


j=l 


d  -  ik  a.)  + 

K2  3 


I  [■ 


<*2h2 


(1  - 


j=k2+q2 


X  ,  /  V  . 

k„  i 

2  J 


*  Cc‘(2,l)  . 


Note  that  the  first  term  inside  the  absolute  value  is  negative  and 
the  second  is  positive.  In  addition 


C1  s  11  -  ‘k/'j1  5  C2 '  Vj  *  *V  V1 . W1’ 


with  Ci><32  positive  numbers.  Hence  from  (9.32)  we  obtain 


V1 


.33)  Y.  P’]2  5  °l'h'2.1>  *  D2  £ 


(9 


r  (k2)n2 
a 


3  =  1 


3=k2+q2 


and 


(9.34) 


I 


r  (k2)‘i 2 


k2-! 


Vh'2’11  + 


j=k2+q2 


».X  [■ 


(k2H2 


3  =  1 


Write 


(9.45) 


\r  —  1 

2  .1/2 


I-S 

j=l 


<  C p.  (h) . 
K2 


Using  (9.29),  (9.44),  and  (9.45),  and  the  fact  that  p,  (h) - >0 

*2 


as  h - >0,  we  get 


Xk„ , h  *k 

Cc^(2,l)  >  — - - 

h  xk2.h 


(9.46) 


2  a(Uk2,h'  I!  >  Xk2b(uk2,h'  i:  v>  .”>a  > 


(k 2)r  (k?) 

^  K 


=k2+q2 


k2"1  -] 
Z  aibi,J(1  -  r7>  ' 

i  =  l 


where  C'  >  0  and  is  independent  of  h.  Combining  (9.36),  (9.39 

(9.40),  and  (9.46)  we  obtain 

V"  f  (k2h2  2 

I  b  J  * 

f=k2+q2 


oo  k2  1 

Y"  ^2*  Y 

+  X  la€  1  >  ' 


^=k2+q2  i=l 


k  -1 
2 


X|ai|ibi.<'] 


C  f “(2,1)  + 


I  I 

i=l  f=k2+q2 


(k  ) 

af  Nblf,l 


TL*  r..M  *.,a  \ji  f .Jl  AjJaVajI  ^  ji  r.Ji  \Jk  ^  '*M  T.M7, Ui? 


(9.47)  is  a  quadratic  inequality  in 

solution  yields 


'XJ 

Z 


r  (k?h 

2' 

b  ] 

1. 


(9.48) 


I  b 


(k2) 


k2-i 


f =k„+q„ 
2  ^2 


o  h( 1 ,k2-i ) 


z  •; 


i  =  l 


(kJi  2 


+  C<'  -  h  (  2  ,  1  ) 


Combining  (9.33)  and  (9.48)  we  get 


k2- 1 


Zf  (k2)1  2  2  V  f  l*2;1 

[a  i  J  ‘  Vb2'1’  +  V'h^'V^Z  |ai  J 

i=l  i=l 


V1 


( k„  ) 


neb 

2  h 


and  thus,  since  (l.kg-l)  is  small  for  h  small. 


k2-l 


(9.49 


z 


-  D5fh(2'1) 


i  =  l 


Next,  combining  (9.34)  and  (9.49),  we  get 


(9.50) 


Z 


(  k  ) 


+  D6t  h(2'1)  • 


t=k_+q_ 
2  ^2 


Finally,  from  (9.31),  (9.49),  and  (9.50)  we  have 


V'V1 


u 


k2  ,  h 


Z 


j=k. 


( k  2  ) 

a  .  u  . 

J  J  a 


V1 

(k2K2 


Z  lb 

L  j=l 


z 


laj 


(  k  2  )  1 


j=k2+q2 


Ca  ,  (  2  ,  1  )  . 


Redefining  u  to  be 
k  2 


k  +q  -1 

2  _2  {k2} 
i.  a  .  u  . 

j=k2  ^  J 

k2+q2_1  (k  ) 

V.  a  .  u  . 

i=k  3  3  a 

3  k2 


we  see  that  u. 


whose 


(2.1), 


1  2 
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so  that  (9.2)  holds,  and 


(9.51) 


|uk2,h  '  uk. 


*  Cc  h  (  2  , 1 ) 


This  is  (9.9)  for  i  =  2,  j  =  1. 


Comment  on  Estimate  (9.51).  In  the  proof  of  (9.51)  we  used  (9.30) 

which  was  proved  in  Step  A.  A  careful  examination  of  the  proof  of 

(9.51)  shows  that  we  that  we  did  not  use  the  full  strength  of 

(9.30),  but  only  the  weaker  fact  that  !|u.  ,  — u  . H  - * 0  as  h - >0 

J  ,  h  j  a 

for  j  <  k2~l.  (Cf.  the  Overview  of  the  Proof.) 

Step  B.3.  Suppose  q„  *  2 .  In  Step  B.l  we  estimated  A ,  L  - 

^  <2  *  ^ 

X.  .  In  this  step  we  estimate  X.  -  X. 

k2  *  k2+l,h  k2+l 

We  proceed  by  modifying  problems  (9.1)  and  (9.4)  by  restrict¬ 
ing  them  to  the  spaces 


k2.h 

H  -  (u  e  H  :  a(u,Uk  h)  =  0} 


and 

k2,h 

Sh  =  <u  e  Sh  :  a(u'uk2,h>  =  °>' 


k2  ,h 

respectively,  i.e.,  we  consider  the  problems  (9.1  )  and 

k2,h  k„,h 

(9.4  )  obtained  by  replacing  H  and  S,  by  H  and 

*2,h  n  k  h 

Sh  in  (9.1)  and  (9.4),  respectively.  (9.4  )  has  the  same 

eigenpairs  uj,h^  as  does  (9.4),  except  that  the  pair 

k2 » h 

»u  )  has  been  eliminated.  (9.1  )  has  eigenpairs 

*2  9  **  ^2  9  ** 
k2  ,  h  k2  ,  h 

(Xj  '  Uj  ^ '  which  in  general  depend  on  h.  Nevertheless, 

*2  /h 

(9.52)  A  =  Au  ,,  t.  =  0 . q  -2, 


i.e.,  A  ,  the  eigenvalue  under  consideration,  is  an  eigenvalue 
K2 


k2  ,  h 

of  multiplicity  q2~l  f°r  Pro^^em  (9.1  ) 


Its  eigenspace  is 


k  ,h 

M  (\k  )  =  (u  e  M(\k  )  :  a(u,uh  k  )  =  0} 


We  can  now  apply  the  argument  used  in  Step  B.l  to  problems 


k  ,h  k  ,h 

(9.1  )  and  (9.4  )  and,  using  (9.7'),  we  obtain  (cf.  (9.29)) 


(9.53) 


Xk2  +  l,h  "  *k2  +  l  5  CC2,2(h)'  for  h  <  h0  * 


k2  ,  h 

Since  ufc  depends  on  h,  the  problems  (9.1  )  and 

k2,h  2< 

(9.4  )  depend  on  h.  It  follows  from  the  Comment  on  Inequality 


(9.29)  with  f  =  h  that  we  can  apply  the  argument  in  Step  B.l  ob¬ 


taining  C  and  hQ  that  are  independent  of  h.  To  see  this,  note 


that  h.{t)sh  by  (9.52),  u.  (r)  s  /i  by  the  minimum 

2  2  3  3 


principle,  and  {  r  )— + //fe  ,  since  //fc  -//k  (  r  )  -  -//k  h<  by 


the  minimum-maximum  principle,  and  uk  ^ - »uk  (cf.  (9.51)),  and 


hence  that  //,  (r)  -  /;  (r)  and  i-t.  (t)  -  //  (r)  are  bounded  away 

1  *2  K2  j 


from  0.  Then  note  that  I(r)  =  r  :  />(T,  ( r  )  )  =  />(T.  )-(u,  .  ), 

h  h  k2 , h 


Int  ( T  (  r  )  )  ‘ T  (  T h (  r  )  )  =  Int  (V  )  -  (<t  (  Th)  -  {uk  )  )  = 


(u.  . . u.  .  },  and  u  ( r )  -u  (r)  = 


uk2-uk2+q2-l.h  rad  <r  <  r  >  > /2  ’  rad  <rl/J 


Step  B . 4 .  Suppose  2  as  in  Step  B.3.  Here  we  show  that 


u.  ,  ,  can  be  chosen  so  that  u,  ,  .  -  u.  .  <  C>  .  {2  ,2)  .  We 

k2+l  k  +l,h  K2+1  a  h 


know  that 


(9.54) 


u  .  ,  -  u  . ; 

J  .h  j  a 


c«hU.  J)  .  j  =  1 . q2 

C£,  (2  ,  1  )  ,  j  =  q  +1  =  k. 


(cf.  (9.15),  (9.21),  and  (9.51)).  Assume  that  u 

have  been  redefined  so  that  (9.2)  holds.  Write 


V1'" 


"k2+(V] 


Uk  +l,h  X 
5=1 


(k2+i) 

a  .  u  . . 

3  3 


If  we  apply  the  argument  used  in  Step  B.2  to  u.  ,  i.e.,  if 

k  2  l ,  h 

we  let  k2  be  replaced  by  k2+l  and  use  (9.53)  instead  of  (9.29 


we  obtain 


k2+q2-1 


V  (k25 

Uk  2  +  l, h  Z-  aj  U5 

5=*o 


a  £  Cch(2'2) 


But,  by  (9.54)  , 


(k2+l) 

ak2  1  "  |a(V,+l,h'uk2>' 

=  ,a(uk2+l,h'uk2  -  uk2,h)! 
"  :Uk2  "  uk2,h  a 


=*  Cfh(2,l) 


*  Cc  h  (  2 , 2  ) 


and  hence 


k2+q2_1 


k0+l , h 


V  (k2) 

A-  a  5  u  i 


o  (h) 


k2+q2+1  (k  ) 

Z  ai  Uj 

j=k  +1  J  2 

Redefining  u.  to  be  ■  — — - ,  we  see  that 

k2  +  1  Vq2_1  (lO 

II  2  a.  u .  |! 

j=k2+l  2  2 

!!u  XJL  =  a<u>  .  i  » )  =  0,  j  =  1 ,  .  .  .  ,  k  ,  so  that  (9.2) 
k2+l  a  k2  +  l  2  2 


holds,  and 


'!uk2  +  l,h  '  Uk2  +  l"a  5  C*2,2(h)  ' 


which  is  (9.9)  for  i  =  j  =  2. 

Step  B.5.  Continuing  in  this  manner  we  prove  (9.S)  and  (9.9) 
for  i  =  2  and  j  =  l,...,q2. 

Step  C.  Repeating  the  argument  in  B  we  get  (9.8)  and  (9.9) 
for  i  =  3,4,...  .  This  completes  the  proof. 

Remark  9.2.  Babuska  and  Aziz  [1973],  Fix  [1973],  and  Kolata 
[1978]  proved  the  estimate 


(9.55) 


A.,  ,  .  -  A.  .  .  ,  <  C  £ .  ( A  .  )  ,  j  =  l,...,q. 

ki+j-i.h  ki+j-l  h  l  J  Mi 


where  £h(A^)  is  definec*  in  (8.21).  (9.55)  is  weaker  than  (9.8). 

For  j  =  l,...,q^-l,  (9.8)  shows  higher  rates  of  convergence  for 

certain  problems;  see  the  discussion  of  multiple  eigenvalues  in 
Subsection  10B.  Birkhoff,  De  Boor,  Swartz,  and  Wendroff  [1966] 
estimated  +j-l  h  “  ^k.+j-l  in  terms  of  the  sum  of  the  sc3uares 

of  the  a-norm  distances  between  S^  and  the  unit  eigenvectors 
associated  with  all  the  eigenvalues  \e  not  exceeding  \.  . 


(10.1) 


“(I  +  ~  ^Au  =  <*>pu 

dO 

-  ( *■  +  ~  ^Av  =  <Jpv'  ( x » Y )  e  0, 


where  0  =  ^  We  obtain  (10.1)  from  (1.35)  by  assuming  that 


dx  dy  ‘ 

u(x,y,z)  and  v(x,y,z)  are  independent  of  z  and  that  w(x,y,z) 
=  0.  The  boundary  conditions  describing  the  traction  free  bounda¬ 
ry  are 
(10.2) 
where 

dv 


Xn  =  Yn  =  0,  (x,y)  e  T  =  <30, 


fx  =  xen 

3u  ,3u 

+  fjt-% —  +  ^  — 

'on  ^'dx 

n  x 

(10.3) 

< 

Y  =  XOn 
{  n  y 

dv  ,<3u 

+  +  *{5y 

(10.2),  with 

X 

n 

and  Y 

n 

given  in  (10 

dv 


,) 


tions  discussed  in  connection  with  the  elastic  solid  specialized 
to  the  L-shaped  panel. 

We  now  consider  the  specific  case  in  which 


v  = 


2  (  \  +/i ) 


-  .3,  e  .  aJJiinn  =  i 


X 


(i.e.,  in  which  X  =  15/26  and  y  =  5/13).  v  is  called 
Poisson  1 s  ratio  and  E  is  called  Young  1 s  modulus  of  elasticity . 

G  =  fj  is  called  the  modulus  of  rigidity .  Note  that  0  s  i  <  1/2 
for  any  material . 

We  now  discuss  the  basic  steps  in  the  finite  element  approxi¬ 
mation  of  the  eigenvalues  and  eigenfunctions  of  the  problem  (10.1) 
(10.2),  or,  more  generally,  of  any  eigenvalue  problem.  These 
steps  are  as  follows: 

1)  Derivation  of  a  variational  formulation  (8.10)  for 


(10.1),  (10.2)  and  verification  of  conditions  (8.1) 


w\ 

V. 


i 

f. 

y. 

P! 


r. 


'.V 


Cv 


v> 

L 


S 


* 


(8.3),  and  (8.6)  ((8.1), (8. 7), (8. 32),  and  (8.33)  in  the 

selfadjoint,  positive  definite  case). 

2)  Discretization  of  (8.10)  and  assessment  of  the  accuracy 

of  the  approximate  eigenvalues  and  eigenfunctions.  The 
discretization  proceeds  by  the  selection  of  the  trial 
space  Sj  ^  and  test  space  S2  verification  of 

(8.11)  -  (8.13),  consideration  of  the  finite  dimensional 
eigenvalue  problem  (8.14),  and  explicit  construction  of 
the  matrix  eigenvalue  problem  (8.15).  The  accuracy  of 
the  approximation  is  assessed  by  means  of  the  application 
of  the  results  of  Section  8. 

3)  Solution  of  the  matrix  eigenvalue  problem  (8.15). 

The  accuracy  of  the  approximation  method  (8.14)  depends  in  a 
crucial  way  on  the  trial  and  test  spaces  S1  h  and  S2  h,  and 
their  rational  selection  is  strongly  influenced  by  the  available 
information  on  the  eigenfunctions,  typically  information  regarding 
their  regularity.  Thus,  also  of  importance  is 

1')  Analysis  of  the  regularity  of  the  eigenfunctions. 

Remark  10.1.  The  approximation  methods  we  will  discuss  in  this 
section  are  referred  to  as  Ritz  methods,  by  which  we  mean  that  the 
eigenvalue  problems  under  consideration  are  selfadjoint  and  posi¬ 
tive  definite  and  that  the  test  and  trial  space  are  equal  (si  h  = 
S2  k  =  S^) ;  see  the  discussion  in  Section  8. 

1 )  Variational  Formulation 

We  begin  by  casting  our  problem  in  the  variational  form 


(  10.4) 


u  c  H 

a(u,v)  =  wb(w,v) ,  V  v  f  H, 


113 


* 

3 


where  H  is  an  appropriately  chosen  Hilbert  space  and  a  and  b 
are  appropriately  selected  bilinear  forms.  This  process  was  ex¬ 
plained  in  Section  3.  We  typically  proceed  as  follows.  Multiply¬ 
ing  the  first  equation  in  (10.1)  by  0,  the  second  by  0,  adding 
the  resulting  equations  together,  and  integrating  over  Q,  we 
obtain 


{[-(X  +  ~  /(A  u  ]0  +  £-(X  +  p)|^  "  £*A  v  ]  0  }dxdy 


(10.5) 


=  w  p ( u0  +  ) dxdy . 

■*0 


Now,  integration  by  parts  shows  that 


<[-(*  ♦  /*>§! 


-  //A  u  ]0  +  [-(X  +  p)^  -  pA  v]  0  )dxdy 
60 

=  (X  +  p)0^dxdy  -  (X  +  p)©0 nxc 

•  r\  Jr 


+  pVu*V0dxdy  -  p^0ds 


+  (X  +  ij)0-~ dxdy  -  (X  +  p)00 n^ds 


(10.6) 


I  ^  V 

+  p7v*V0dxdy  -  p^vds 

0  ■‘r 

<<» -  *«>(5s  ♦$(&♦§$) 


<?v~)  ("60  dn~)  d\x  chi)  6v  60 

+  [ay  +  a^j  [ay  +  axj  '  2ax  '  2ay  ^x1  )dxdy 


(X  0  +  Y  0 ) ds 
n  nr ' 


Combining  (10.5)  and  (10.6)  we  see  that  if  (<.>,(u,v))  satisfies 


(10.1)  and  (10.2),  then 


0 


(10.7) 


.  rau 

dv" 

1  (d<t> 

,  dtp') 

^  [ax 

+  ^yj 

1  [tt 

+  ^yj 

+  n[ 

'3u  , 

avl 

30 

ay  + 

3xJ 

avl 

3xJ 


au  ajp 

ox  3y 


.av  34.,  ,  , 

aS>|dxdY 


=  w  p ( u0  +  vy ) dxdy 
Q 

for  all  smooth  (<p  ,ip)  ,  and,  conversely,  If  (10.7)  holds  for  all 

smooth  (<p  ,y)  ,  then  (10.1)  and  (10.2)  hold,  provided  u  and  v 

2 

are  smooth  (u,v  e  H  (Q ) )  . 

From  (10.7)  we  see  how  to  choose  H,  a,  and  b  in  (10.4). 


Let 


(10.8) 


H  =  H1(Q)xH1(0) 

"U.VIII2  =  ♦  llvl,2_0 


and  on  H  define  the  bilinear  form 


(  10.9) 


a  ( u ,  v ;  0  ,  ip  )  =  {  (A  + 

^0 


It  is  immediate  that  (8.1)  is  satisfied  and  that  a  is  symmetric. 
Let  us  remark  that  a(u,v,-u,v)  has  the  physical  meaning  of  the 
(double)  strain  energy  and  that  /a ( u , v ; u , v )  is  referred  to  as 
the  energy  norm  of  (u,v).  Recall  from  Section  8  that  b  is  to  be 
defined  on  a  space  W  o  H.  Let 


(10.10) 


Iw  =  L2(0)<L2(G) 

l  no 


o 


m 


and  define 


(10.11) 


b ( u , v ;  0 , ip )  =  p ( ub  +  vv>) dxdy . 


It  is  immediate  that  b  is  symmetric  and  satisfies  (8.7)  and 
(8.33)  and  that  H  c  w,  compactly.  It  remains  to  consider  (8.32) 
Note  that  since  a  and  b  are  symmetric,  H1(0)  and  L2(Q)  may 
be  taken  to  be  real . 

We  begin  by  expressing  a(u,v;  <t>  ,V>)  in  terms  of  the  Poisson 
ratio  v  and  the  modulus  of  rigidity  G: 


a(u,v;  0,0)  = 


(10.12) 


_2G_  {(1_v)fiH  d±  +  df\ 

l-2v  u  1  v'  (dx  dx  dy  dyj 
J0 

fdu  dip  dv  d0j 

V  [dx  dy  dy  dxj 

l-2t>fdu  .  dvj  fd0  dip']  .  .  , 

+  __  [^  +  ^J  dxdy . 


From  (10.12)  we  have 


a  ( u ,  v ;  u ,  v )  = 


{ ( 1-^  )  [ 


(10.13) 


2G  /  /  i  ,  w  f5u1  ^  fdv'l  . 

1-2^  ]  [  [dxj  [dyj  ] 

Q 

,  „  du  dv  ,  1-2^  fdu  dvj  2  .  ... 

V  dx  dy  2  [dx  dxj  X  ^ 

2G  f  .  .  ,  _  ,  r  fdu)  2  ,  fdvj  2 
{(1_2y)(  [dxj  +  [dyj  1 
1 Q 

,  l-2t^fdu  ,  dvj2.  .  ... 

+  —  [5y  +  d^J  }dxdy- 


Recalling  that  v  <  1/2,  we  see  from  (10.13)  that 

a ( u , v ;  u,v)  >  0,  V  u,v 

(as  was  to  be  expected  from  the  physical  interpretation),  and  that 
a(u,v;  u.v)  =0  if  and  only  if 


i  Vi  M  '.I  M 


(10.14) 


u  =  u  =  c,  +  c  y 

l,c2*  3 

v  =  v„  „  „  =  c_  -  cx 

C1'C2,C3  3  2 


for  some  c  ,c  ,c  .  These  displacements,  which  are  characterized 

1  fc  w 

as  having  no  strain  energy,  are  the  "rigid  body  motions,"  i.e., 
translations  and  rotations.  Thus  (8.32)  does  not  hold  with  H 
defined  by  (10.8),  but  the  above  considerations  suggest  that  it 
might  hold  if  H  is  replaced  by  a  smaller  space  that  did  not 
include  the  rigid  body  motions.  In  fact,  if  we  define 


(10.15)  H  =  { ( u , V )  e  H  :  p(uu 


C1 ,C2'C3 


+  vv 


C1'C2'C3 


)  dxdy  =  0 , 


V  Cl,c2,c3}, 


then  it  can  be  shown  (see  Necas  and  Hlavacek  [1970]  and  Knops  and 
Payne  [1971])  that 

(10.16)  a(u,v;  u ,  v )  >  a  I!  ( u ,  v)  li  2  =  a(||u|!2  Q  +  l!vi|2  ft), 


V  ( u , v )  e  H, 

where  a  is  a  positive  constant.  This  is  (8.32). 

We  thus  restrict  a(u,v;  <t>  ,ip)  to  H  and  b(u,v;  «,»/’)  to 


(10.17)  W  =  { ( u , v )  e  W  :  P ( uu  +  vv  )  dxdy  =  0 

JQ  C1'C2'C3  C1'C2'C3 


V  Cl,c2,c3). 


For  the  eigenvalue  problem  (10.1),  (10.2)  we  therefore  have  the 
variational  formulation 

fO  0  (  U ,  V)  e  H 

(10.18)  ] 

[a  (  u ,  v ;  0  ,  V  )  =  (i)b  ( u ,  v ;  0,v),  V  (0,0)  C  H  . 


>.S 

....  '.V 

•  -V 


Thus,  with  a,  b,  H,  H,  <•> ,  and  <•>  chosen  as  in  (10.8)  -  (10.11), 
(10.15)  -  (10.17),  we  see  that  a  and  b  are  symmetric  and  con¬ 
ditions  (8.1),  (8.7),  (8.32),  and  (8.33)  are  satisfied.  (10.18) 

is  a  selfadjoint,  positive  definite  problem  of  the  type  studied  at 
the  end  of  Section  8. 

As  stated  in  Section  8,  (10.18)  has  a  countable  sequence  of 

eigenvalues 

0<<y1s<1>2<.  .  .  .  S'  +ao 
and  corresponding  eigenfunctions 

(ux , v1 ) ,  (u2,v2) , . . . , 

which  can  be  chosen  to  satisfy 

a,W  W  “VW  W  *  "j6ir 

When  implementing  our  approximation  method  it  is  simpler  to 
consider  the  eigenvalue  problem  on  the  space  H  instead  of  on  H, 
i.e.,  to  consider  (10.18)  with  H  replaced  by  H.  Then  <.>Q  =  0 
will  be  a  triple  eigenvalue  with  eigenfunctions  (u,v)  given  by 
(1,0),  (0,1),  and  (-y,x).  These  eigenpairs  and  their  approxima¬ 
tions  are  then  ignored.  If  the  rigid  body  motions  are  included 
in  the  space  S1  ^  and  S2  then  =  0  is  also  a  triple 

approximate  eigenvalue  with  the  rigid  body  motions  again  the  cor¬ 
responding  approximate  eigenfunctions.  If  this  is  not  the  case, 
then  dealing  with  H  and  H  does  not  lead  to  the  same  approxi¬ 
mate  eigenvalues  and  eigenfunctions.  It  is  easy  to  analyze  the 

case  in  which  the  rigid  body  motions  are  not  in  S,  .  and  S„  ,  , 

1 ,  h  2  ,  h 

but  we  will  not  do  so.  Alternately,  the  validity  of  (10.16)  or 
(8.32)  can  be  ensured  by  considering 


a(u,v;  <t>  ,  ip)  =  a  ( u ,  v ;  <p  ,  V )  +  b  (  U  ,  V  ;  <?>,») 


instead  of  a(u,v;  <t>  ,ip)  .  Then  the  triple  eigenvalue  1  would  be 


the  lowest  eigenvalue.  Usually  the  first  alternative  is  used. 


1')  Regularity  of  the  Eigenfunctions . 


We  have  seen  in  Section  8  that  the  accuracy  of  the  approximate 


eigenvalues  and  eigenfunctions  depends  on  the  degree  to  which  the 


exact  eigenfunctions  and  adjoint  eigenfunctions  can  be  approxmated 


by  elements  in  the  spaces  S1  h  and  S2  respectively  (see 


(8.23)  -  (8.26)).  In  the  selfadjoint,  positive  definite  case  this 


reduces  to  the  degree  to  which  the  eigenfunctions  can  be  approxi¬ 


mated  by  (see  (8.44)  -  (8.46)).  Since  the  approximabi 1 i ty  of 


the  eigenfunctions  depends  on  their  regularity,  it  is  essential  to 


determine  the  basic  regularity  properties  of  the  eigenfunctions. 


The  eigenfunctions  of  (10.18)  have  the  following 


properties . 


u,  and  v.  are  analytic  in  Q  -  UA . ,  where  A.  are 
i  i  j  l 


the  vertices  of  Q.  This  follows  from  the  general  theory 


of  elliptic  equations  (see  Morrey  [1966,  Section  6.6] 


The  functions  u^  and  v^  are  singular  at  the  vertices 


of  Aj ,  and  the  character  of  the  singularity  is  known. 


The  strength  of  the  singularity  at  Aj  depends  on  the 


interior  angle  at  A^ .  For  the  domain  we  are  consider¬ 


ing,  the  strongest  singularity  is  at  the  vertex  AQ .  The 


leading  terms  of  u^  and  v^  at  a  vertex  have  the  form 


(10.19) 


u*  =  c  j  ra  F i ( 0 ) 


v  =  C2r  F2(0), 


v  v  r 


where  (r ,0)  are  the  polar  coordinates  with  origin  at  the 
vertex,  a  depends  on  the  interior  angle  and  on  \  and 


H,  and  F1(0)  and  F^{0)  are  analytic  functions  of  0  . 
The  value  of  o  is  characterized  as  the  root  of  a  nonlin¬ 
ear  equation  and,  in  general,  can  be  real  or  complex.  For 
our  example  of  the  L-shaped  domain,  a  =  =  .544481-  • 

for  the  vertex  AQ .  For  a  more  complete  discussion  of  the 
singularities  of  solutions  of  elliptic  equations  in  poly¬ 
gonal  domains  we  refer  to  Kondratev  [1968],  Merigot 

[1974],  and  Grisvard  [1985].  Using  their  results,  any 

11  2  2 

eigenfunction  can  be  written  as  (u,v)  =  (u  ,v  )+(u  ,v  ), 

2  2  k 

where  u  ,v  e  H  (0 ) ,  where  k  is  an  integer  which  is 
greater  than  or  equal  to  3,  and  (u^v1)  is  a  linear 
combination  of  functions  of  the  type  on  the  right  side  of 
(10.19)  with  cr  >  aQ  and  with  coordinates  centered  in 
the  vertices  of  0.  Application  of  the  method  used  in 

* 

the  proof  of  Theorem  2.1  shows  that,  for  our  domain,  u 
* 

and  v  and  thus  u.  and  v.  are  contained  in 
•«0+1 

H  (0),  with  =  .544481- ■ • .  This  statement  of  the 

regularity  or  smoothness  of  the  eigenfunctions  is  the 
strongest  that  can  be  made  in  terms  of  Sobolev  spaces 
(without  weights). 

It  is  also  possible  to  show  that  u,v  e  ft ‘ (0  )  ,  for  any 
8  >  o  (8  =  o  +  k  )  ;  see  Theorem  4.4.  Of  course,  the 

2  -‘V1 

space  ft  {Q )  is  much  smaller  than  H  (0)  and  hence 

we  can  make  a  more  effective  choice  for  S,  if  we  use 

2  -'V1 

ft*(Q)  instead  of  H  (0). 
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Approximate  Eigenvalues  and  Eigenfunctions 


The  discretization  of  (8,11)  is  accomplished  by  selecting  the 

trial  and  test  spaces  S 1  h  and  S2  h  satisfying  (8.11)  -  (8.13), 

considering  the  finite  dimensional  eigenvalue  problem  (8.14),  and 

deriving  the  matrix  eigenvalue  problem  (8.15)  from  which  the 

approximate  eigenvalues  are  obtained.  The  selection  of  S„  .  and 

1 ,  h 

S2  h  is  the  most  important  part  of  this  process.  It  is  influ¬ 
enced  by  three  considerations. 

a.  The  spaces  h  and  S2  h  have  to  satisfy  (8.11)  and 

(8.12).  Note,  however,  that  if  the  problem  under  consi¬ 
deration  is  selfadjoint  and  positive  definite,  from  (8.32) 

we  see  that  (8.11)  and  (8.12)  hold  for  S.  .  =  S-  .  =  S.  , 

l  #  n  2  ,  n  n 

for  any  Sh>  Our  problem  (10.18)  is  self-ajoint  and  posi¬ 
tive  definite  and  we  will  take  S,  .  =  S„  .  =  S.  . 

X  f  n  2  f  ti  n 

b.  S2  ^  should  accurately  approximate  the  eigenfunctions  of 

(10.18)  and  S2  ^  should  accurately  approximate  the 

adjoint  eigenfunctions.  Usually  we  also  require  that  the 

rigid  body  motion  functions  are  included  in  S„  .  and 

1  ,  h 

Sg  h •  If  this  is  not  the  case,  then  we  have  to  assume 
that  the  rigid  body  motion  functions  are  very  well  approx¬ 
imated.  If  they  are  not  well  approximated,  although  there 
will  be  no  change  in  the  asymptotic  rate  of  convergence, 
the  accuracy  will  deteriorate,  especially  with  long 
domains  (such  as  long  beams),  for  which  the  rigid  body 
motions  for  some  parts  of  the  domain  could  be  relatively 
large . 


MW 


c.  The  matrices  A  and  B  in  (8.15)  should  be  reasonably 


sparse,  since  sparcity  is  strongly  related  to  computation¬ 
al  complexity.  Sparcity  is  achieved  by  choosing  finite 
element  spaces  for  S1  h  and  S2  These  spaces  then 

have  bases  consisting  of  functions  with  local  supports, 
and,  as  a  consequence,  A  and  B  will  be  sparse.  We 
note  that  the  sparseness  of  A  and  B  is  not  required 
for  the  validity  of  the  results  of  Section  8  and,  in 
fact,  in  certain  applications  one  does  use  non  finite 
element  type  trial  and  test  spaces,  spaces  consisting  of 
global  polynomials  or  trigonometric  polynomials,  e.g. 

We  now  describe  some  typical  choices  for  S^  for  the  L-shaped 
panel . 


The  h-Version  on  a  Uniform  Mesh 

Let  0  be  covered  by  a  mesh  of  uniform  squares 


of  size 


h  as  shown  in  Figure  10.2. 


Figure  10.2.  A  Uniform  Mesh  on  Q 
Then  for  p  =  1,2,...  let 

(10.20)  iP  =  (u  :  u  e  H1(Q) ,  u| 

i  j 

where 

(10.21)  ©(p)  =  {(m,n)  :  0  <  m,n,  and  m+n  <  p  or  (m,n)  =  (l,p) 

or  (p, 1 )  )  . 


X 


.  m  n 
A  x  y  , 
m ,  n 


(m,n)e-D (p) 


Spaces  of  this  type  are  said  to  be  composed  of  elements  of  type  Q 
(the  cases  p  =  1,2,3  are  discussed  in  Ciarlet  [1978]).  Basis 
functions  for  these  spaces  can  be  constructed  in  various  ways;  for 
example,  by  means  of  Lagrange  elements  (see  Ciarlet  [1978])  or  by 
use  of  hierarchial  elements. 

Regarding  the  approximation  properties  of  the  family  (sj^) 


it  can  be  shown  (see  Ciarlet  [1978])  that  if  k  >  1  is  an  integer 
then 


(10.22)  inf  II u  -  x\\1  Q  s  C(p)h^-1||u||lc  Q,  for  any  u  e  Hk(0) 

and  any  h  >  0, 

where 

(10.23)  n  =  min(k,p+l) 

and  C(p)  depends  on  p,k,  and  Q,  but  is  independent  of  u 
and  h.  (10.22)  is  optimal  in  the  sense  that  h^  on  the  right 
hand  side  cannot  be  replaced  by  a  higher  power  of  h  when  the 
mesh  is  uniform.  If  k  is  not  an  integer,  then  we  have 

(10.24)  inf  ||u  -  *||  *  C(p)h/i-1||u|| .. 

with  /i  given  by  (10.23).  Note  that  we  have  not  said  anything 

about  the  dependence  of  C(p)  on  p.  The  proof  in  Ciarlet  [1978] 

sugggests  that  C(p)  grows  with  p,  and  thus  could  lead  to  the 

conclusion  that  it  is  improper  to  use  p  >  k-1.  However,  this 

- ( k-1 ) 

conclusion  is  not  justified  because,  in  fact,  C(p)  <  Cp  ; 

see  Babuska  and  Suri  [1987b]. 

We  will  now  derive  (10.24)  -  (10.23)  from  (10.22)  -  (10.23) 
using  the  method  outlined  in  Section  2  (cf.  (2.12)  -  (2.16)  and 
Theorem  2.1). 

Suppose  m  <  k  <  m+1 .  Since 

Hull-.  =  sup  (t"€>K(u,t)), 

H  (Q)  0<t«» 

where  0  =  k-m,  we  see  that 

K(u,t)  <  t°  Hull  .. 


Let  f  >  0.  Then  for  any  t  >  0  there  exist  e  Hm(o)  and  w 


e  Hm+1(Q)  such  that  u  =  and 


|vt“«,0  +  t!lVm+l,0  5  K(u't|  +  £  5 

H  (U  ) 


+  £  . 


Therefore,  using  (10.22)  -  (10.23)  we  can  choose  *  .  e 

1  ^  Z  ,  X. 


so  that 


lvt  -  rl.t5l,0  s  Clp)^1  8vtl.>0  *  c(p)h"1  (te!!u» 

H  ) 


+  £  )  , 


where  =  min(m,p+l),  and 


i!wt  -  *2,t!!l.a  ‘  C(P'h,<2  5  'IP'"'*  ♦  .«) 

H  (U  ) 


where  I*  2  ~  min(m+l , p+l ) .  Letting  xt  =  *1  t  +  *2  t'  we  thus 


have 


^  i  —  i  a  f*  o  —  ^  a  _  i 

(10.25)  !|U  -  *  !|  o  5  C(p)(h  tw  +  h  *  t*  1 )  ( !|uH .. 

t  im  Hk(Q) 


+  €  )  , 


for  any  t  >  0. 


If  p  >  m,  select  t  =  h  in  (10.25)  to  obtain 


9+1*,- 1 

inf  !!  u  -  t!!  <  C(p)h  (!!u!! 

H  (0) 


+  f  ) 


«*g 


=  C(p)hk_1(;!U)) 


+  C  ) 


H  (0  ) 


=  C<  p)h"  1  ( Hull  -k  +  £  )  , 

UK  /  r\  \ 


H“(0  ) 


where  //  =  min(k,p+l).  If  p  <  m,  let  t  =  1  in  (10.25)  to  get 


li  -1 

inf  !u  -  r !!  -  n  '  C(p)h  ( !!  u !>  _ .  +  e) 

xeS*  H 


=  C(p)hp( :! ui!  +  c 

H  (O  ) 


C/5  i 

rr  tj 


in  these  estimates 


with  (j  =  min(k,p+l).  Letting  £ 

yields  (10.24)  -  (10.23). 

Now  define 


Hull.. 

H  (Q  ) 


(10.26) 


We  remark  that  the  rigid  body  motions  belong  to  Sh  (cf.  (10.14)). 
Since  (10.18)  is  selfajoint  and  positive  definite  and  satisfies 
(8.32),  we  see  that  (8.11),  with  0(h)  =  a,  and  (8.12)  hold. 
(10.22)  and  (10.24)  show  that  Sh  accurately  approximates  the  exact 
eigenfunctions.  Thus  (10.22)  and  (10.24),  together  with  a  density 
argument,  show  that  (8.13)  Is  satisfied.  If  an  appropriate  basis 
is  chosen  for  S^,  the  matrices  A  and  B  in  (9.15)  can  be  cal¬ 
culated  and  they  will  be  sparse.  Thus  the  issues  raised  above  in 
a.,  b. ,  and  c.  have  been  addressed. 

Now  consider  the  problem  (8.14)  with  this  choice  for  S,  . 

1 ,  h 

and  S2  h  and  denote  its  eigenvalues  and  eigenfunctions  by 


and 


0  <  "i,h  * 


<  Ci> 


N ,  h 


(ul,h'vl,h} 


(UN,h'VN,h) 


where  N  =  dim  S^.  To  assess  the  accuracy  of  these  approximate 
eigenpairs,  the  results  of  Section  8  will  be  applied.  All  of  the 
hypotheses  for  these  results  have  now  been  shown  to  be  satisfied 
for  our  problem  and  approximation  procedure. 


Theorem  10.1.  Let  S,  .  and  S„  .  be  selected  as  in  (10.26). 

l  f  n  2 1  n 

Suppose  is  an  eigenvalue  of  (10.18)  with  multiplicity  q. 


i.e.,  suppose  ^  <  Wfc  -  «k+1  =  ...  =  <  <.>k+q.  Then 

(10.27)  h  -  s  C ( p) h1 ‘ 088962  ,  j  =  k . k+q-1. 

If  (w,  ,  z.  )  =  (u.  .  ,v.  .  ),  j  =  k,..., k+q-1,  then  there  is  a  unit 
n  n  j  <  ^  j » n 

eigenfunction  (u,v)  -  (u^,v^)  of  (10.18)  such  that 

544  48 1 •  •  • 

(10.28a)  ;i u  -  whil1Q  +  || v  -  zhll1<0  *  C(p)h,04**Bi 


and  tf  (u,v)  is  a  unit  eigenfunction  of  (10.18)  corresponding  to 


«k,  then  there  is  a  unit  vector  (wh'zh^  r 


sp{  (u,  .  ,  v.  .),...,  (u.  .  ,  .,v.  ,  ,  .  )}  such  that 

k,h  k,h'  k+q-1, h  k+q-1, h  ' 


(10.28b)  || u  -  whH  1  Q  +  || v  -  z 


h" 1 ,Q  5  C(p)h 


.544481 


If  «k  is  simple,  the  eigenfunction  estimates  reduce  to 


(10.280  ||ukjh  -  ukll1>Q  +  Hvk(h  -  vkHlfQ  *  C(p)h 


544481 


*f70  +  l 

Proof .  We  saw  in  Subsection  1')  that  u^  and  are  in  H 


with  oQ  =  .544481- Thus  from  (10.22)  -  (10.24)  we  have 


inf 


h  -  sup  inf  l!(u.,v.)  -  (*.,*_)  II  .  . 

j-k . k+q-l  r=(t1,t2)^sh  J  J  Ha(0)vHa(0) 


«  C ( p ) h  . 


(10.27)  and  (10.28)  follow  from  this  estimate  and  (8.44)  -  (8.46; 


o 


To  show  the  effectiveness  of  estimates  (10.27)  -  (10.28)  we 
would  have  to  know  the  exact  eigenfunctions  and  eigenvalues. 
Because  these  are  not  available  we  consider  instead  the  quantity 

*  *  *  * 

inf  a ( u  -x  ,v  -x . , u  -r  .  v  -t  ) , 

*=(*,. -r9KS. 


(10.29)  Q(p.h) 


where  u  and  v 


are  given  in  (10.19).  Q(p,h)  can  be  computed 


numerically.  Figure  10.3  shows  the  graph  of 

»'»E.R  *  [‘^•hl/a(u\v’:u\v*,]1''2 

as  a  function  of  h,  for  various  values  of  p.  H e II E  R  is  the 

relative  error  in  the  energy  norm  measure  of  the  degree  to  which 
*  * 

(u  ,  v  )  can  be  approximated  by  functions  in  S^.  The  graph, 
which  is  plotted  in  log-log  scale,  is  a  straight  line  and  thus 

iiei,e,r  =  Ch°, 

where  a  is  the  slope  of  the  line.  We  see  that  the  slope  is  very 
close  to  the  theoretically  predicted  a  =  . 544481--.  Increasing 
p  decreases  the  constant  C  but  does  not  affect  the  slope  a. 

From  an  analysis  of  Figure  10.3  we  can  draw  several  conclu¬ 
sions  : 

•  To  achieve  an  accuracy  of  5*  (respectively,  3%)  with 
elements  of  degree  p  =  1  we  would  require  N  to  be  about  25,000 
(respectively,  N  to  be  about  170,000)  and  with  elements  of 
legree  2  we  would  require  N  to  be  about  19,000  (respectively, 

N  to  be  about  124,000).  This  shows  that  a  uniform  or  quasiuni 
form  mesh  is  completely  unacceptable  for  our  problem. 

•  Because  the  rate’  of  convergence  for  eigenvalues  is  twice 
that  for  eigenfunctions,  we  see  that  the  eigenvalues  are  much 
cheaper  to  compute  than  the  eigenfunctions.  Roughly  speaking,  we 
see  that  for  eigenvalue  calculations  the  required  number  of  un¬ 
knowns  would  be  about  N  *  160  (respectively,  about  N  =  400)  for 
p  =  1  and  about  N  =  140  (respectively,  N  -  350)  for  p  =  2. 


RELATIVE 
IN  ENERGY  N 


Figure  10.3.  The  Relative  Approximation  Error  Measured  in 
the  Energy  Norm.  The  h-V<»rsion. 


The  p-Version 


In  the  h-version  of  the  finite  element  method  accuracy  is 
achieved  by  letting  h^  0 ,  while  p  is  held  fixed.  In  the  p- 
version  of  the  finite  element  method,  one,  in  contrast,  fixes  h 
and  lots  p/"®. 

Let  again  be  defined  by  (10.20)  -  (10.21).  Regarding 

the  (p-version)  approximation  properties  of  the  family 

(Sh  ,  it  can  be  shown  that  if  u  =  u.  +  u0,  where  u  e 

n  p  i  f  c. ,  .  »  •  l  t  ± 

1c  o 

H  (0),  with  k  ^  2,  and  u2  =  Kr  F(0),  with  o  >  0,  then 

(10.30)  inf  I; u  -  x\\lfQ  s  c(h)[Kp_2cr  +  p-(k'1)i|u1i;k  . 


We  remark  that  in  (10.30)  it  is  essential  that  the  origin  of  ft 
lies  on  an  element  vertex;  for  in  this  case,  the  estimate  for  u2 
is  of  twice  the  order  as  would  be  obtained  if  we  based  our  esti¬ 
mate  on  the  assumption  that  u2  e  Ha+1  and  used  the  h-version 
with  a  uniform  mesh.  For  a  proof  of  (10.30),  see  Babuska  and  Suri 
[ 1987a] . 


Define 
( 10.31 ) 


S 

P 


Then  (8.11),  with  (1  ( h)  =  a,  and  (8.12)  are  satisfied.  (10.30) 
shows  that  Sp  accurately  approximates  the  exact  eigenfunctions 
and  thus  that  (8.13)  is  satisfied.  We  see  that  the  issues  raised 


in  a.,  b.,  and  c.  have  been  addressed.  In  connection  with  c., 
however,  we  observe  that  the  matrices  A  and  B  are  less  sparse 
than  with  the  h-version.  Note  that  the  parameter  p,  which 
approaches  ®,  is  here  playing  the  role  of  the  parameter  h  in 


Section  8,  which  approached  0. 


Now  consider  the  problem  (8.14)  with  this  choice  for  S 
and  S  and  denote  the  eigenvalues  and  eigenfunctions  by 

* ,  p  ' 


1  ,P 


0  <  cj,  < 
1  .P 


<  (O 


N,p 


and 


(Ul,p'Vl,p) . (UN,p'VN,p) ' 


where  N  =  dim  S^.  As  with  the  h-version,  the  accuracy  of  the 
approximate  eigenpairs  may  be  assessed  with  the  results  of  Section 
8. 


Theorem  10.2.  Let  S,  _  and  S„  _  be  chosen  as  in  (10.31). 

1  ,  P  <-  /  P 

Suppose  is  an  eigenvalue  of  (10.18)  with  multiplicity  q. 

Then 


-2  177924- • • 

(10.32)  lUj  -  okl  *  C  ( h )  p  t-1"***  ,  j  =  k . k+q-1, 

and 


(10.33) 


lu 


k ,  p 


-  u 


k"  1 ,0 


k,  p 


-  v 


k"  1 ,0 


C(h)p 


-1 . 088962 


Note  that  we  have  given  the  eigenfunction  estimate  the  simplified 
form  it  has  when  is  simple;  it  would  have  to  be  modified  if 

has  multiplicity  greater  than  1.  See  the  statement  of 
Theorem  10.1. 


Proof.  Suppose  has  multiplicity  q  and  let  w  be  either 

component  of  one  of  the  eigenfunctions  corresponding  to  c->k .  We 

have  seen  that  w  can  be  written  in  the  form  w  =  w  +  w  ,  where 

2k.  2 

w  e  H  (0) ,  with  k  >  3,  and  w  is  a  sum  of  terms  of  the  type 

(10.19)  with  n  ■>  a  and  with  coordinate  centers  at  the  vertices 

of  Q.  Because  a  =  .544481-  in  (10.19),  from  (10.30)  we  have 


F  < 

P 


C(h)p 


-1 .088962-  •  • 


(10.32)  and  (10.33)  follow  from  this  estimate  and  (8.44)  -  (8.46) 


f 


To  illustrate  the  performance  of  the  p-version  we  consider, 

as  with  the  h-version,  the  relative  error  in  the  energy  norm  mea 

*  * 


sure  of  the  degree  to  which  (u  ,v  )  can  be  approximated  by  S 

P 

(cf.  (10.29)).  Figure  10.4  presents  the  graph  of  <| e(*  as  a 

c  /  R 

function  of  p,  for  various  values  of  h.  Again  the  log-log 
scale  is  used.  We  see  that  the  slope  is  close  to  the  theoretical 
ly  predicted  -1.088962- • • .  This  is  valid  only  for  p  >  3,  but 
recall  that  all  our  results  are  of  an  asymptotic  nature. 
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IN  EN 


DEGREE  p  OF  ELEMENTS 


1 

si 


Figure  10.4.  The  Relative  Approximation  Error  Measured  in  the 

Energy  Norm.  The  p-Version. 


To  assess  the  relative  effectiveness  of  the  h-  and  p- 
versions,  to  understand,  in  particular,  their  dependence  on  the 


choice  of  S.  and  S  is  not  easy.  Here  we  content  ourselves 
h  p 

with  a  brief  assessment  in  terms  of  the  number  of  degrees  of  free 

dom:  N  =  2  dim  =  dim  S,  =  dim  S  .  In  Figure  10.5,  the  rela- 

h  h  p 

tive  error  in  the  energy  norm  measure  of  the  accuracy  is  plotted 

-2  2 

as  a  function  of  N.  Since  N  =  h  and  N  =*  p  ,  the  rates  of 
convergence  shown  in  Figure  10.5  are  half  those  shown  in  Figures 
10.3  and  10.4.  We  see  that  with  respect  to  degrees  of  freedom, 
the  p-version  with  h  =  1/2  performs  better  than  the  h-version 
with  p  =  1 , 2 , 3 ,  or  4 . 


The  h-p-Version 

In  this  version  of  the  finite  element  method  accuracy  is 
achieved  by  simultaneously  decreasing  the  mesh  size  h  and  in¬ 
creasing  the  polynomial  degree  p.  We  here  distinguish  various 
cases.  The  major  ones  are: 

i.  Uniform  mesh  and  uniform  p  distribution,  (i.e.,  the 
polynomial  degree  p  is  the  same  on  each  mesh  sub- 
domain,  i.e.,  element); 

ii.  Refined  (non  uniform)  mesh  and  uniform  p  distribution; 

and 

iii.  Refined  mesh  and  selective  increase  of  degree  p. 


We  will  now  elaborate  on  cases  i.  and  ii.  Case  i.  obviously  com¬ 
bines  the  h-  and  p-versions  discussed  above.  In  this  case  one 
has 


Theorem  10.3.  Let  S.  .  =  S.  ,,  . 

-  1 , (h,p)  2 , (h,p) 


=  S 


,,  .  =  SP*SP  and 

( h ,  p )  h  h 


let  <,),  .  and  (u.  ,,  .  ,v,  .)  be  the  associated  eigen- 

k,(h,p)  k,(h,p)  k, (h,p)  y 

values  and  eigenfunctions.  Suppose  is  an  eigenvalue  of 

(10.18;  with  multiplicity  q.  Then 


(10.34) 


|wj,(h,p)  -  wk'  *  C<min 


crQ  hmin{a0’p-a0N  2 


So, 


}  ,  j  =  k, . . . , k+q-1 , 


and 


(10.35) 


luk,(h.p)-Vi.o  +  l!vk,(h,p)-Vi,o  !  C("ln 


hmin«.  ,p-o  )-, 

0 


2  a , 


where  =•  .544481 


and  C  is  independent  of  h  and  p. 


Proof .  The  basic  approximation  results  for  this  type  of  approxima 


tion  were  proved  by  Babuska  and  Suri  [1987b],  (10.34)  and  (10.35) 


follow  directly  from  these  results  and  (8.44)  -  (8.46).  □ 

In  case  ii.  we  will  consider  only  geometric  meshes  with  ratio 

factor  .15;  see  Figure  10.6.  This  ratio  is  close  to  optimal. 

The  space  S(.  .  =  is  now  more  complicated.  Sr;  is  defined 

( h ,  p )  h  h  h 

by 

«  1 

=  (u  £  H  (Q)  ;  u|T  is  the  image  of  a  polynomial 

ij 

in  a  square 

S  =  {(K  ,T])  :  |$  I  ,  I'M  *  1) 

or  a  triangle 

T  =  {  (K  )  i0snsK,0iKsl), 

for  all  subdomains  I _  in  the  mesh) . 

For  a  more  detailed  description  of  see  Babuska  and  Guo 

[1987b,  c]  and  Szabo  [1986],  For  a  thorough  discussion  of  the 
h-p-version  in  the  one  dimensional  setting,  we  refer  to  Gui  and 
Babuska  [1986]. 

Figure  10.7  shows  the  performance  of  the  p-version  on  meshes 
with  various  numbers  of  layers  n,  as  well  as  the  performance  of 
the  p-  and  the  h-versions  for  uniform  meshes.  We  typically  see 
a  reverse  S  curve  for  the  accuracy  of  the  p-version  on  a  geo¬ 
metric  mesh.  The  first  part  of  the  curve  is  convex  and  then  it  is 
concave,  the  slope  approaching  N  The  h-p-version  appears  as 

the  envelope  of  the  p-version  on  geometric  meshes  with  various 
numbers  of  layers.  This  envelope  shows  the  optimal  relation  be¬ 
tween  the  number  of  layers  and  the  polynomial  degree.  In  Babuska 
and  Guo  [1987b],  it  is  shown  that  if  u  <? 


with  0  <  !i  <  1  , 
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Figure  10.7.  The  Relative  Error  in  the  Energy  Norm  in 


Dependence  on  N  for  Various  Meshes 
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Using  this  approximation  result  and  the  results  of  Section  8,  we 
obtain 


Theorem  10.4.  Suppose  the  components  u^  and  of  the  eigen- 

2 

functions  belong  to  (in  our  case  =  .544481-  ••  +  r.  )  .  Then 

with  a  proper  choice  of  geometric  mesh  and  the  degree  p  we  have 


( 10.36) 


|wk.P  -  v 


<  c.-2^ 


(10.37) 


I!Vp  -  Vl.O  +  !!vk,p  -  Vl.O  ‘  Ce^VB- 


where  a  depends  on  the  ratio  of  the  mesh,  the  relation  of  p 
and  the  number  of  layers,  and  the  domain,  but  is  independent  of 


Proof .  (10.36)  and  (10.37)  follow  directly  from  the  results  of 


Section  8  and  the  above  estimate  for  i!  e(! 


E ,  R  ’ 


Figure  10.7  clearly  shows  the  effect  of  the  proper  selection 
of  meshes  and  element  degrees  on  the  effectiveness  of  the  finite 
element  method.  It  also  shows  that  the  optimal  choice  depends  on 
the  required  accuracy.  The  design  of  the  mesh  and  selection  of 
the  degree  of  the  elements  is  a  delicate  task.  Various  approaches 
to  deal  with  this  problem  are  in  the  research  phase.  One  promis¬ 
ing  approach  is  to  apply  the  principles  of  artificial  intelligence 
(expert  systems).  For  further  details  we  refer  to  Babuska  and 

Rank  [1987].  Figure  10.7  shows  only  the  dependence  of  the  accu¬ 


racy  on  the  number  of  degrees  of  freedom  N.  It  is  also  essential 
to  judge  the  complexity  of  the  method  with  respect  to  such  factors 
as  number  of  operations,  computer  architecture,  user's  interaction 
etc.  For  a  detailed  study  of  computer  time,  accuracy,  and  perfor- 


mance  for  various  numbers  of  degrees  of  freedom,  we  refer  to 
Babuska  and  Scapolla  [1987].  We  can  see  directly  from  Fig  10.7 
that  the  proper  mesh  design  leading  to  an  accuracy  of  5%  has  2 
layers  (the  ratio  of  the  sizes  of  elements  is  of  order  50)  and 
p  =  3.  For  an  accuracy  of  3$,  the  optimal  p  is  3  or  4  and 
the  number  of  layers  is  2  or  3  (which  leads  to  size  ratios 
from  50  to  300).  The  number  of  degrees  of  freedom  is  200  -  300 
(compared  with  25,000  -  170,000  for  a  uniform  mesh  and  p  =  1). 

3)  Solution  of  the  Matrix  Eigenvalue  Problem. 

We  have  seen  that  the  approximation  procedure  developed  in 
Section  8  leads  from  the  eigenvalue  problem  (8.10)  or  (10.18)  to 
the  generalized  matrix  eigenvalue  problem  (8.15),  and  that  the 
matrices  A  and  B  in  (8.15)  are  sparse  if  the  bases  for  the 
trial  and  test  spaces  are  properly  chosen.  From  the  error  esti¬ 
mates  in  Section  8  we  know  that  the  low  eigenvalues  of  (8.15) 
(approximately  10%  of  them)  give  reasonable  approximations  to 
the  exact  eigenvalues.  In  fact,  we  have  proved  convergence  for 
each  fixed  eigenvalue,  but  convergence  does  not  occur  for  a  fixed 
percentage  of  the  available  eigenvalues.  If  A  and  B  have 
dimension  N,  then  N  approximate  eigenvalues  are  available,  but 
W[aN]  h — >w[aN]  as  N — * 00 '  where  0  <  a  <  1  and  [aN]  is  the 
integral  part  of  aN.  The  10%  figure  we  mentioned  above  is 
related  to  engineering  accuracy  and  practice  only.  The  size  of 
the  matrix  problem  will  thus  be  much  larger  than  the  number  of 
eigenvalues  we  are  attempting  to  calculate.  The  matrix  eigenvalue 
solver,  a  crucial  component  of  the  complete  computational  proce¬ 
dure,  should  therefore  be  designed  to  effectively  find  the  low 


eigenvalues  of  large,  sparse,  generalized  matrix  problems.  An 
appropriate  version  of  the  Lanczos  algorithm  is  suitable  for  this 
class  of  matrix  problems  and  is  often  used  in  practice.  We  refer 
to  the  monographs  by  Parlett  [1980]  and  Cullum  and  Willoughby 
[1985].  Because  the  extraction  of  the  eigenvalues  is  very  expen¬ 
sive,  various  "tricks"  are  used  in  engineering  practice  to  reduce 
the  sizes  of  the  matrices  under  consideration.  We  will  not  go 
further  in  this  direction. 

Remark  10.2.  It  should  be  emphasized  that,  generally,  the  goal  of 
the  computation  is  to  find,  in  addition  to  the  eigenpairs,  certain 
functionals  of  the  eigenfunctions  (u,v)  —  e.g.,  the  stress  inten¬ 
sity  factors,  which  are  combinations  of  the  derivatives  of  u  and 
v.  We  will  not  pursue  this  direction  since  it  lies  beyond  the 
scope  of  this  article.  We  refer,  e.g.,  to  Babuska  and  Miller 
[1984]  and  Szabo  and  Babuska  [1986]. 

Remark  10.3.  The  complete  computational  resolution  of  an  eigen¬ 
value  problem  is  influenced  by  a  wide  range  of  factors.  Some  of 
the  most  important  of  these  —  the  smoothness  of  the  eigenfunctions 
and  the  approximation  properties  of  the  trial  and  test  space,  e.g. 
—  have  been  discussed  in  detail.  Others  —  the  accuracy  of  the 
matrix  eigenvalue  solver  and  the  relation  between  the  accuracy  of 
the  matrix  solver  and  the  error  ^  e.g.  —  have  not  been 

mentioned  or  have  only  been  mentioned  briefly.  While  these  latter 
factors  are  important,  we  will  not  be  able  to  pursue  them.  We 
also  note  that  the  important  function  of  a  posteriori  analysis  of 
computed  data  has  not  been  discussed.  Likewise  we  have  not  dis¬ 
cussed  any  adaptive  approaches.  For  some  ideas  on  the  assessment 


w 


of  the  quality  of  the  finite  element  computations  we  refer  to  the 
survey  paper  of  Noor  and  Babuska  [1987]. 


B.  Vibration  of  a  Membrane 

We  consider  here  the  eigenvalue  problem  associated  with  the 
small,  transverse  vibration  of  a  membrane  stretched  over  a  bounded 
region  ft  in  the  plane  and  fixed  along  its  edge  r  =  dft ,  i.e., 
the  eigenvalue  problem 


( 10.38) 


-A  u  =  Xu,  in  ft 


u  =  0  on  f 


(cf.  Subsection  I.B.,  in  particular,  (1.27)).  We  turn  now  to  a 
discussion  of  the  basic  steps  1),  1'),  2),  and  3)  (cf.  Subsection 
A.  above)  in  the  finite  element  approximation  of  the  eigenpairs  of 
(10.38).  The  discussion  can  be  brief  since  these  steps  are  simi¬ 
lar  for  the  two  problems  (10.1)  -  (10.2)  and  (10.38),  in  fact  for 
any  eigenvalue  problem. 

1 )  Variational  Formulation 

(10.38)  is  a  special  case  of  problem  (3.1)  and  the  variation¬ 
al  formulation  (3.18)  of  (3.1)  was  derived  in  Section  3.  Thus  we 
see  that  the  variational  formulation  of  (10.38)  is  given  by 


( 10.39) 


'u  e  H*(ft  )  ,  u  *  0 


fdu  dv  du  dv) 

(.55  55  +  cT?  d7jdXdY  = 


uv  dx,  V  v  e  Hq (ft 


fdu  dv  du  dvl  ,  ,  f  _  _  ,  _ 

a(u,v)  =  [d5  d5  +  7^  dV|dxdy  =  7u‘7v  dxdy 

0  '  ft 


<«*/ 


be  defined  for  u,v  e  H  =  HQ(Q),  and  let 


b ( u , v )  =  uv  dx 


be  defined  for  u,v  e  w  =  L2(Q).  Then  (10.39)  has  the  form  of 
(8.10),  and  a  and  b  are  symmetric  forms,  (8.1),  (8.7),  (8.32), 

and  (8.33)  are  satisfied,  and  H  c  w,  compactly.  All  of  this  can 
be  easily  seen  for  the  concrete  problem  we  are  considering;  it  also 
follows  from  the  more  general  discussion  in  Section  3.  (10.39)  is 

a  selfadjoint,  positive  definite  problem.  It  has  eigenvalues 

0  <  \  ^  <  \  2  5  .../"-wo 
and  corresponding  eigenfunctions 


, u2 , • • • , 


which  can  be  chosen  to  satisfy 


Vu.-Vv.  dxdy  =  \  .  u . u  .  dxdy  =  6 .  .  . 
i  J  j  i  j  ij 

0  Jo 


1 ' )  Regularity  of  the  Eigenfunctions 

From  Theorems  4.1  -  4.4  we  obtain  the  following  regularity 
results  for  the  eigenfunctions  u.  of  (10.39)  (or  (10.38)). 

lr 

• )  For  k  >  2,  if  r  =  30  is  of  class  C  ,  then  u^  e 
Hk(0 ) . 

•)  If  T  is  of  class  C°°,  then  u^  e  C°° (Q )  . 

•  )  If  F  is  analytic,  then  u^  is  analytic  in  Q. 

• )  If  Q  is  a  curved  polygon  with  analytic  sides  and  with 

vertices  A_,A,,...,  then  u.  is  analytic  in  0  -  UA . . 

0  1  i  j 

u.  is  singular  at  the  vertices;  the  strengths  of  the 


singularities  depend  on  the  interior  vertex  angles. 

2 

Moreover,  u  e  for  properly  chosen  (i . 

2 )  Discretization  of  (8.10)  and  Assessment  of  the  Accuracy  of  the 
Approximate  Eiqenpairs 

Suppose  0  is  a  polygon.  By  a  triangulat ion  or  mesh  on  0 
we  will  mean  a  finite  family  r  =  satisfying 

•  each  1\  is  a  closed  triangle, 

M(t) 

•  0  -  U  . 

i=l 

•  for  any  and  Tj  e  T '  Ti  n  Tj  =  0  or  a  common  vertex 

or  a  common  side. 

For  o  <  a  <  n,  a  triangulation  r  is  said  to  be  o-regular  if 
the  minimal  angle  of  every  triangle  T  e  r  is  greater  than  or 
equal  to  a.  For  any  r,  let 

h  =  h(r)  =  max  diam  T. 

i-1 .... >M{t ) 

and 

h ( r )  =  min  diam  T . . 

i  =  l _ ,M(r)  1 

r  is  said  to  be  q-quasiunif orm  if 


We  will  often  view  triangulations  T  =  as  Par^meter ized  by  h 

=  h(r)  and  consider  families  y  =  {r}  =  (T^)  of  triangulations 
that  are  a-regular.  An  example  of  a  n/4-regular ,  1 -quasiuni form 


triangulation  of  the  domain  Q  =  {(x,y)  :  -1  <  x  <  1,  -1  <  y  <  1} 
is  shown  in  Figure  10.8.  It  is  called  a  uniform  triangulation. 


Figure  10.8.  A  Uniform  Triangulation. 


For  t  a  triangulation  of  ft  and  p  =  1,2,...  let 

p  1  i 

SMt)  =  (u  :  u  e  H  <0 )  :  u|T  *  a  polynomial  of  degree 

for  each  Ter} 

and  let 

Sq  (  t  )  =  SP(T)  n  hJ(Q)  . 


Regarding  the  approximation  properties  of  SP(r)  and  S 
k  >  1  (integer  or  non  integer)  and  p  >  1  and  if  y  =  (t ) 
family  of  a-regular  triangulations  of  0,  then 


( 10.40a) 


inf  II  u-t:  II  j  q  <-  ■ 

|fSP(T)  P 


k,o ' 


for  any  u  e  H  (ft)  and  for  any  r  e  y  , 


( 10.40b) 


inf  Ku-tl!  <  Ch<^;  Hull  , 
<=P, .  .  P 


re  sP  (  « 


ic  1 

for  any  u  e  h  (ft)  n  HQ(ft)  and  for  any  t  c  / , 


.  r.  ■'  #'  * .  < .  «  -r.  •  v  • 


a  o 


where 


(10.41)  n  =  min(k,p+l). 

The  constant  C  in  (10. 40a, b)  is  independent  of  p,  r,  and  u, 
but  depends  on  0,k,  and  a.  For  a  complete  proof  of  these  esti¬ 
mates  we  refer  to  Babuska  and  Suri  [1987b]. 

Now  define 


(10.42) 


1, (h.p) 


=  S 


2,  (h,p) 


=  S 


(h,p) 


=  Sq  ( t  ) 


Since  (10.39)  is  selfadjoint  and  positive  definite  and  satisfies 
(8.32),  we  see  that  (8.11)  and  (8.12)  are  satisfied.  (10.40b) 
shows  that  S ^  p^  accurately  approximates  the  exact  eigenfunc¬ 
tions  and  thus  that  (8.13)  is  satisfied.  If  a  suitable  basis  is 
chosen  for  S,.  .  ,  then  the  matrices  A  and  B  in  (8.15)  will 

be  sparse.  The  issues  raised  in  a.,  b. ,  and  c.  in  Subsection 
A. 2)  above  have  now  been  addressed  for  this  choice  for  S,,  .  . 

Note  that  in  using  the  notation  we  are  identifying  h  = 

h(r)  with  r.  An  alternate,  and  more  explicit,  notation  would  be 

S(t ,p) * 

Now  consider  problem  (8.14)  with  S,.  .  defined  as  in 

(h,  p) 

(10.42)  and  denote  the  approximate  eigenvalues  by 


Xl, (h.p)  5  ’ • ’  4  XN, (h,p) 


and 


U1 ,  (  h ,  p )  . UN ,  (  h ,  p )  ' 


where  N  =  dim  j.  To  assess  the  accuracy  of  these  approxi¬ 

mate  eigenpairs,  we  apply  the  results  of  Section  8,  obtaining 


Theorem  10.5.  Let  S(,  .  be  selected  as  in  (10.42)  and  suppose 

-  (h,p) 


X^  is  an  eigenvalue  of  (10.39)  with  multiplicity  q.  Then 


( 10.43) 


(10.44) 


U<,(h,p)  V  s  Cp2k-2 '  e  "  j . 3+q  *' 


|uj,  (h.p)  -  uj"i.Q  s 


where  k  s  1  is  such  that  the  eigenfunctions  corresponding  to  X^ 
are  in  H  (Q )  and  /j  =  min(k,p+l).  Note  that  we  have  given  the 
eigenfunction  estimate  the  simple  form  it  has  when  X^.  is  simple; 
it  would  have  to  be  modified  in  the  general  case.  See  the  state¬ 
ment  of  Theorem  10.1. 

Proof.  Suppose  X^  has  multiplicity  q.  Then  (10.39)  has  eigen¬ 
functions  Uj , . . . iUj+g_1 ,  associated  with  X^;  by  assumption, 

Jr 

these  eigenfunctions  are  in  H  (0 ) .  Thus,  by  (10.40)  -  (10.41), 


we  have 


(10.45) 


£h  = 


• • • . J+q-i  *eS 


inf  llu<-*lllfQ 


(h,p) 


hH-l  n-l 

*  ,  Hmax  "Vk.o  =  cnFI- 

p  <■-  j  ,  •  .  .  ,  j+q-l  p 

(10.43)  and  (10.44)  follow  directly  from  this  estimate  and  (8.44) 

-  (8.46).  n 

Remark  10.4.  If  our  membrane  is  free  instead  of  fixed  along  its 
edge,  then  we  would  have  considered  the  Neumann  boundary  condition 

—  =0.  In  this  situation  the  eigenvalue  problem  would  have  the 
variational  foundation 


( 10.46) 


u  e  H,  u  p*  0 
a(u, v)  =  Xb(u,v) , 


where  a  and  b  are  as  above,  but 


H 


{u  :  u  e  H1(0) , 


udxdy  =  0 } , 


JG 

I|U|IH  =  11  Ul1  l.Q* 


We  would  choose 


S(h,p) 


(u 


u  e  SP(r) , 


udxdy  =  0} 
Q 


for  the  trial  and  test  space.  Then  all  of  the  hypotheses  in  Sec¬ 
tion  8  are  satisfied,  approximation  results  similar  to  (10.40)  can 
be  proved,  and  for  the  approximate  eigenpairs,  the  error  estimates 
(10.43)  and  (10.44)  follow.  We  note  in  particular  that  the 
Neumann  boundary  condition  is  only  implicitly  stated  in  (10.46), 
i.e.,  is  natural,  and  thus  that  the  boundary  condition  need  not  be 
imposed  on  the  trial  and  test  functions.  This  fact  makes  imple¬ 
mentation  easier,  especially  for  domains  with  curved  boundaries. 
See  the  discussion  of  natural  and  essential  boundary  conditions  in 
Section  3. 


3)  Solution  of  the  Matrix  Eigenvalue  Problem 

The  comments  made  in  Subsection  A. 3)  apply  here  as  well. 
Multiple  Eigenvalues 

The  results  proved  in  this  Subsection  and  in  Subsection  A. 
cover  the  case  of  multiple  eigenvalues.  Recall  that  the  estimates 


tor  |Xj.(h.p)  -  V  and  lluj.(h,p>  '  uj81.0  are  in  terms  of 


'h  " 


max 


inf 


<=1 . q 


1 . (h,p) 


%  '  *»l.0’ 


where  q  is  the  multiplicity  of  *  .  and  u,  ,...,u  are  the 


corresponding  eigenfunctions.  We  now  make  some  comments  on  mul¬ 
tiple  eigenvalues  and  then  make  an  application  of  the  refined  error 
estimates  for  multiple  eigenvalues  proved  in  Section  9. 


The  eigenvalues  and  eigenfunctions  of  the  membrane  problem  on 
a  square,  i.e.,  the  problem 


(10.47) 

where 


-A  u  =  A.  u  on  0 

•< 

u  =  0  on  r , 

Q  =  { (x, y)  :  I x| , |y |  <  n) , 


are  easily  seen  to  be  given  by 

1  =  k2  +  -£2 

k,^  K 

and 

ufc  t  -  sin  kx  sin  -Cy,  k,<  ■  1,2,  .  .  .  . 

Hence  v;e  see  that  there  are  multiple  eigenvalues.  (10.47)  is 
typical  of  problems  with  symmetries  ((10.47)  is  symmetric  with 
respect  to  x  and  y) ,  and  we  thus  see  that  multiple  eigenvalues 
are  common  in  applications. 

For  i  =  1,2,...,  let  X.  be  an  eigenvalues  of  (10.47)  of 

Ki 

multiplicity  qi,  i.e.,  suppose 

*kA-l  <  Xk.  =  XkA+l  =  •••  =  Aki+qi-l  <  Xki+q1  <  Xk.+1- 

Note  that  we  are  here  using  the  notation  introduced  in  Section  10, 

whereby  =  1 '  k2  is  the  lowest  index  of  the  second  distinct 

eigenvalue,  etc.  Suppose  now  that  q.  >  1,  i.e.,  that  X.  is 

i  xi 

multiple.  Let  (S^)  be  any  family  of  finite  dimensional  sub¬ 
spaces  of  H^(0)  satisfying  (9.14).  Recall  from  Section  7  that 


Xki,h'  -  Xki+qi-i,h 

converge  to  X.  .  While  these  approximate  eigenvalues  may  be 
Ki 

equal,  i.e.,  we  may  have  one  distinct  eigenvalue  with  multiplicity 

q. ,  consideration  of  the  situation  in  which  we  choose  S,  to  be 
i  n 

S^t),  where  r  is  the  triangulation  shown  in  Figure  10.8.,  shows 
that  they  may  not  be  equal,  since  some  of  the  symmetries  present 
in  (10.47)  are  not  present  in  the  discrete  problem.  Nevertheless, 
Theorem  10.5  provides  estimates  for  each  of  the  errors 


'"k^j-l.h1 

mates  are 


k1+j-l1,  i  1 


,q^.  As  we  have  seen  the  esti- 


|Xki+j-l,h  Xk1+j-l 


s  Cc 


(10.48) 


C  [  sup  inf  llu-*ll1Qr, 

ueM(X  )x*Sl(r) 

Ki 

j  =  i . q^ 


which  suggest  that  the  error  in  X.  .  depends  on  the  degree 

fl  ,  K^*r  J  1 

to  which  S1 (t )  can  approximate  all  of  the  eigenfunctions  corres¬ 
ponding  to  X.  . 

Ki 

Recall  that  in  Section  9  (Theorem  9.1)  we  proved  refined 
estimates,  namely, 

(10.49)  ,Xki+j-1,h“Xki+j_1!  «  C£i,j(h),  J  =  l,...,qi. 


c i  j(h)  =  inf  inf 

ueM(X,_  )  tesV) 


where 


“I“,'*k1,hl 


ki+j-2,h/ 

Now  for  the  specific  problem  (10.47),  all  eigenfunctions  have  the 
same  smoothness  properties  and  S1(t)  with  r  given  in  Figure 
10.8,  will  approximate  them  all  with  the  same  asymptotic  accuracy 
and  (10.48)  and  (10.49)  would  each  lead  to  the  same  estimate  in 
terms  of  h.  The  multiplicative  constants  in  the  estimates  could, 
however,  be  different.  We  further  note  that  there  are  eigenvalue 
problems  for  which  the  different  eigenfunctions  corresponding  to  a 
multiple  eigenvalues  have  strikingly  different  approximability 
properties.  For  such  problems  (10.49)  would  provide  a  striking 
improvement  over  (10.48). 

As  an  example  of  such  a  problem,  consider 

~  (^"IxT  u#(x)]  =*<P'(x)u,  xe  I  =  {-n  ,n  )  , 

(10.50)  -u(-w)  =  u(w), 

U'j(-Ti)  =  U'j(tr), 

* 

where 

<p(x)  =  n  a|x|i+asgn  x,  0  <  a  <  1. 

It  is  easy  to  check  that  the  eigenvalues  and  eigenfunctions  are  as 
shown  in  Table  10.1. 


Table  10.1 

Eigenvalues  and  Eigenfunctions  of  the  Eigenvalue  Problem  (10.50) 


i 

Xi 

u . 

l 

0 

0.0 

1 

1 

1.0 

cos  <p  ( x) 

2 

1.0 

sin  <p  (x) 

3 

4.0 

cos  2 <p(x) 

4 

4.0 

sin  2<p  ( x) 

: 

| 


We  see  that  \1  =  X^,  Xg  =  X 4#  etc. 


We  cast  this  problem  into  the  variational  form  (10.1)  by 
choosing 


With  these  choices  (8.10)  is  equivalent  to  (10.50),  with  the 

understanding  that  the  eigenpair  (XQ,uQ)  =  (0,1)  of  (10.50)  is 

not  present  in  (8.10).  Note  that  lluil  =  l|ulla.  Let  II  ull  - 
1/2 

(b(u,u) )  .  The  assumptions  made  in  Section  9  are  clearly 

satisfied.  Our  approximation  is  defined  by  (9.4)  with 

S,  .  =  S„  .  =  S.  =  (ueH  :  u  linear  on 
1 ,  n  2  ,  n  n 

(-n+jh,  -n  +  (j  +  l)h),  j  =  0,1 . n-1), 

where  h  =  2^/n  and  n  is  an  even  integer. 


Now  this  choice  for  (S^)  it  is  easily  seen  that 

(10.51)  inf  !i  cos  <p  (x)  -  .f !!  2  -  Ch2 

*eSh 

and 

O  1  X/v 

(10.52)  inf  ||sin  <p  (x)  -  x\\  -  Ch 

*eSh 

Hence  from  Theorem  10.1  we  would  expect  X 1  h  and  X^  h,  the  two 

approximate  eigenvalues  that  converge  to  the  double  eigenvalue 

=  X to  have  different  convergence  rates. 

From  Tables  10.1  and  10.2  we  can  find  the  errors  in  X,  . ,  i 

h,  i 

=  1,2, 3, 4.  for  a  =  .4.  These  errors  are  plotted  in  Figure  10.8 

in  log-log  scale.  We  clearly  see  the  different  rates  of  conver- 

2  1+ct  1  4 

gence,  specifically  seeing  the  rates  h  and  h  =  h  for 

the  errors  in  X i  h,  for  i  =  1,3  and  i  *  2,4,  respectively, 
as  suggested  by  (10.51)  and  (10.52).  It  should  be  noted  that  the 
estimates  presented  in  Theorem  10.1  are  of  an  asymptotic  nature  in 
that  they  provide  information  only  for  small  h  (or  large  n) , 
i.e.,  for  h  (or  n)  in  the  asymptotic  range.  From  Figure  10.8 
we  see  that  for  a  =  .4  we  are  in  the  asymptotic  range  quite 
quickly,  say  for  n  2:  16. 

Consider  u ^  h  and  u2  the  approximate  eigenfunctions 

corresponding  to  x ^  h  and  ^2  h'  respectively,  normalized  by 

II*  Up  =  1.  The  results  of  Section  9  suggest  that  u ^  h  should  be 

close  to  C  cos  <p(x)  and  u  .  close  to  C  sin  ®(x)  (cf.  (10.51) 

&  r  n 

and  (10.52)),  where  C  is  such  that  C  sin  <p(x )  and  C  cos  <p(x) 

-1/2 

are  normalized  by  II  •  II D  =  1,  i.e.,  c  -  n  .  To  illustrate 

this  point  we  have  computed  cj*^  and  '  1  =  1 »  2 , 3 , 4  ,  so 


i 


1 


that 


K(i) 


Hu.  .-cj^  cos  <p(x)  -  C*1*  sin  v>{x)||  ,  i  =1,2 

1 ,  n  l  i  a 

It  u  -  .-C^  cos  2  <p(x)  -  cj.^^  sin  2</>(x)||  ,  i=  3,4 

i ,  h  1  2  a 


,v 

.V 

:■> 


£ 


V. 


» 


a 


is  minimal.  We  would  expect  that 


( 10.53) 


0 


and 

(10.54)  C{11)  =  C<2)  =  C*3)  =  C<4)  a-  C  =  .564189583... 


Table  10.2  shows  some  of  the  results  for  a  =  .4.  We  see  clearly 
the  results  predicted  in  (10.53)  and  (10.54).  Table  10.2  also 
shows  that  K(l)  <  K(2)  and  K(3)  <  K{4),  as  we  would  expect. 

The  last  columns  in  Table  10.2  and  Figure  10.8  show  that  the 
ratios 

*i+l,h  -  Xi+1  4  _  ,  0 
\ .  .  -  A  .  '  1  ” 


increase  as  h - >0 .  This  shows  that  in  the  whole  h-range  we  con¬ 

sidered,  the  approximate  eigenvalues  converging  to  a  multiple 
eigenvalue  are  well  separated. 


3 

i. 


156 
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Table  10.2 


Numerical  Solution  of  the  Eigenvalue  Problem  (10.50)  for  a  =  .4 


n 

i 

h.h 

K(i) 

SHHEHEff 

1 

1 

1.0716754 

.5637791 

0 

-  .1124891 

-16 

1 . 5562955 

8 

2 

1 . 1115481 

-.4151973 

-13 

. 5636998 

0 

3 

5 . 0394692 

m 

.5558919 

0 

. 1317809 

-12 

1 . 1943249 

4 

5 . 2414639 

9 

. 5022638 

-13 

. 5516234 

0 

1 

1.0175850 

.  1329 

E 

.  5641633 

0 

. 1596754 

-12 

2.0041570 

16 

2 

1 . 0352431 

.  1881 

E 

- . 8916589 

-12 

.5641519 

0 

3 

4 . 2691915 

.  5259 

0 

. 5636643 

0 

.1124328 

-13 

1 . 2575063 

4 

4 . 3385100 

.  5869 

0 

- . 2689727 

-12 

.5637697 

0 

n 

1.0043740 

.  6618 

B 

.  5641879 

0 

. 6411454 

-11 

!  '  1 

2 . 6003887 

32 

H 

1.0113741 

.  1067 

m 

.  1323421 

-10 

.  5641830 

0 

H 

4 . 0666055 

.  2589 

0 

.  5641561 

0 

.1970954 

-10 

1 . 4067517 

Q 

4.0936974 

.  3067 

0 

-.7375504 

-10 

. 5641613 

0 

1 

1.0010921 

.  3305 

-1 

.5641895 

0 

. 7729760 

-9 

3 .5190001 

64 

2 

1.0038431 

.6202 

-1 

.8670648 

-9 

.  5641883 

0 

3 

4.0166006 

.  1289 

0 

. 5641875 

0 

. 3641341 

-10 

1 . 6437659 

4 

4 . 0272875 

.  1653 

0 

.  1415775 

-8 

. 5641858 

0 

1 

1 .0002729 

.1651 

-1 

. 5641895 

0 

. 4535626 

-7 

4 .9215830 

128 

2 

1 . 0013431 

.  3665 

-1 

.3251219 

-7 

. 5641893 

0 

3 

4 . 0041468 

.  6440 

-1 

.  5641895 

0 

.4409247 

-7 

2 .0107071 

4 

4.0083380 

.9135 

-1 

-.9705611 

-8 

. 5641890 

0 

1 

1 . 0000682 

.8255 

-2 

. 5641896 

0 

. 8070959 

-5 

7 . 0542522 

256 

2 

1.0004811 

.  2193 

-1 

.7269570 

-6 

. 5641895 

0 

3 

4.0010365 

.  3217 

-1 

. 5641896 

0 

. 6435344 

-6 

2 . 5706705 

4 

4 . 0026645 

.5162 

-1 

-  .  2601000 

-6 

.  5641895 

0 

Consider  next  the  case  when  n  =  .01.  Table  10.3  presents 
the  same  results  Col-  a  .01  as  Table  10.2  does  for  a  =  .4. 
Figure  10.9  shows  the  graph  of 


log 


Xi+l,h  -  Xi  +  1 
Xi,h  "  Xi 


1,3. 


as  a  function  of  the  number  of  intervals  n  in  a  semi-logarithmic 

scale.  The  computed  values  are  indicated  by  0's  and  x's.  The 

graphs  are  formed  by  interpolation  (solid  lines)  and  extrapolation 

(dotted  lines).  We  note  three  related  phenomena  that  did  not  occur 

with  a  -  .4.  For  small  n  the  approximate  eigenfunction  associ- 

-1/2 

ated  with  X .  .  is  u,  .  *  n  sin  *(x) ,  in  contrast  to  u,  ,  * 

1 1  n  1 1  n  l  f  n 

-1/2  -1/2 
n  cos  <p{x)  when  a  =  .4.  We  remark  that  »  cos  />(x)  is 

-1/2 

more  easily  approximated  by  than  is  n  sin  <p(x)  for  all 

0  <  a  <  1.  This  anomaly  i«j  present  for  n  *  64  but  for  n  ^  128 

we  get  results  which  are  in  agreement  with  the  (asymptotic)  results 

in  Section  9.  For  X  -  .  and  X.  .  we  have  to  take  n  >  256  to 

3  ,  h  4  ,  h 

get  results  which  agree  with  the  asymptotic  theory. 

For  a  =  .01  we  see  that  K{2)  <  K(l)  for  small  n(n  <  64) 

and  K(2)  >  K(l)  for  large  n  and  K(4)  <  K(3)  for  small 

n(n<128)  and  K(4)  >  K(3)  for  large  n.  Recall  that  K(2)  > 

K ( 1 )  and  K(4)  >  K(3)  for  all  n  when  a  =  .4. 

Finally  we  note  that  when  a  =  .01  the  ratio 

Xi+l,h  “  X i+1  4  _  ,  „ 

X.  .  -  X.  '  1  ~ 

i ,  h  a 

first  decreases  as  n  increases,  then  for  some  n  the  two  eigen¬ 
value  errors  become  equal,  and  then  the  ratio  increases  again. 


This  is  in  contrast  to  the  case  for  o. 


.4, 


in  which  the  ratio 


’■-T?."  v.’wo\lvr* 


increased  over  the  whole  range  of  n  values.  We  further  note 


that  the  value  n  for  which  the  eigenvalue  errors  are  equal 


n  *  70  for  i  =  1  and  n  -  160  for  i  =  2  (see  Figure  10.9)  — 


marks  a  transition  in  each  of  these  situations  from  u„  .  =»  n 

1 ,  h 


-1/2 


-1/2  -1/2 

sin  <p(x)  to  ui  h  ~  71  cos  and  u3  h  ”  n  sin  2^(x) 


to  u3  h  *  n  1/2  cos  2  <p(x),  from  K(2)  <  K(l)  and  K  ( 4 )  <  K(3) 


to  K(2)  >  K(l)  and  K{4)  >  K(3),  and  from 


X i  +  1 , h  Xi+1 


X  .  ,  -  X  . 

l ,  h  l 


,  i  = 


1,3,  decreasing  to  increasing. 


We  have  thus  seen  that  for  a  =  .4  the  numerical  results  are 


in  concert  with  the  (asymptotic)  results  in  Section  9  for  the  whole 


range  of  n  considered,  while  for  a  »  .01  they  are  in  disagree¬ 


ment  for  small  n,  but  are  in  agreement  for  large  n.  We  now  make 


an  observation  that  further  illuminates  these  two  phases  of  error 


behavior  —  the  pre~asymptotic  and  the  asymptotic.  Toward  this  end 


we  note  that  if  (^i»u1),  with  ||u1(!b  =  1,  and  (X1  ^Uj  h)  , 


with  ||Uj  hHb  =  1,  are  first  eigenpairs  of  (9.1)  and  (9.4),  res¬ 


pectively,  then 


0  <  X 


-  X  =  llu.  ,-u  ||_  -  X.llu.  .  — u 


1  l,h  lb 


(10.55) 


=  inf  [IU-UjII2  -  X  jJU  -Uj  |]  2  ] 
-t";Sh 


i*»b=1 


If  Xj  is  a  multiple  eigenvalue,  then  the  u^  in  (10.56)  can  be 


any  corresponding  eigenvector  with  ||u^|lb  =  1.  (Note  that  we  are 


here  assuming  Uj  and  Uj  b  have 


-length  equal  1,  whereas 


in  (10.2)  and  (10.5)  they  are  assumed  to  have 


-length  equal 


to  1.)  The  first  Inequality  in  (10.55)  follows  from  the  minimum 


principle  (8.35)  and  has  already  been  stated  in  (8.42).  The  first 
equality  in  (10.56)  follows  immediately  from  Lemma  9.1  with  (\,u) 


=  (\1,u1) ,  w  = 
with  IU!(b  =  1 


“i.h-  and  x  =  a<ui,h'ui.h)  =  S.h- 


If 


then  from  the  minimum  principle  (8.35), 


* £  sh 


(10.56)  *1  h  "  X1  s  »(*■*)  -  lj- 

Again  from  Lemma  9.1,  this  time  with  with  (A,u)  =  ( \ ^ , u ^ ) ,  w  =  x , 
and  A  =  a(x,x),  we  have 

(10.57)  aQ(x,X)  ~  x  1  ~  IU-UjII^  -  ^•1!U-u1!!b. 

The  second  equality  in  (10.55)  follows  from  (10.56)  and  (10.57). 

It  is  clear  from  the  above  discussion  that  u^  can  be  any  eigen¬ 
vector  corresponding  to  A  . 

From  (10.55)  we  hive 

(10.58)  \1  h-A1  <  lU-UjIlJ  -  A1IU-u1Hb,  V  ^  g  Sh  with  |Ullb  =  1. 


If  x  is  ||*||  -close  to  u,  ,  to  be  more  precise,  if  x  is  taken 
to  be  the  a-projection  of  u1  onto  (cf.  (8.17)),  then  the 

second  term  as  the  right  side  of  (10.58)  is  negligible  with  respect 
to  the  first  term.  This  follows  from  the  compactness  assumption 
made  in  Section  9.  On  the  other  hand,  if  ||u  -t||  is  not  small, 

X  a 

Aj  b  -  A1  may  still  be  small  because  of  cancellation  between  the 

two  terms  on  the  right  side  of  (10.58).  Regarding  the  case  a  = 

.01,  this  explains  why  for  h  large  (the  pre-asymptotic  phase),  we 

-1/2 

can  have  u  .  *  n  sin  <p  (x)  and  K(l)  >  K(2),  and  yet  have 

x  /  n 

*h  l'  aPProx*mate  eigenvalue  associated  with  Uj  b>  closer  to 

Aj  than  is  X^  b<  the  approximate  eigenvalue  associated  with 
-1/2 

u„  .  *  n  cos  v>(x).  while  for  h  small  (the  asymptotic  phase), 


we  have  Uj  h  -  «  COS  ^  <  and  Ai  h  closer  to 

X1  than  is  *2  h'  showin9  that  the  eigenvalue  error,  X ^  h  -  *i( 

2 

is  governed  by  inf  ItJ-u.ll  . 

_  -L  3 

*‘Sh 

The  analysis  of  example  (10.50)  we  have  presented  is  taken 
from  Babuska  and  Osborn  [1987]. 


n 

a 

Xu  . 
h,  l 

K(i) 

•H 
'  *H 

o 

c'1’ 

X.  .  -X  .  , 

h , 1 + 1  5+1 

X.  .-X. 
h,  l  i 

i 

1.0520268 

.  2338 

0 

.8181940 

-12 

.5634386 

0 

MRVQQHj 

8 

2 

1 .0529172 

.  2268 

0 

.5645965 

0 

- . 2916448 

-11 

3 

4 . 8576239 

.  9593 

0 

-.9346720 

-13 

. 5597529 

0 

1 .0164293 

4 

4.8717141 

.9615 

0 

.  5604533 

0 

.1167277 

-11 

1 

1.0128661 

G 

.  8717399 

-10 

. 5635957 

0 

1 .0111689 

16 

2 

1.0130098 

1 

G 

.5647369 

0 

-.8480131 

-9 

3 

4 . 2088367 

.4650 

0 

.2507177 

-10 

. 5636658 

0 

1 . 0087030 

4 

4.2106542 

.4577 

0 

. 5642694 

0 

- .3101833 

-10 

1 

1.0032139 

B 

-.9345818 

. 5636031 

0 

1 . 0068764 

32 

2 

1.0032360 

B 

.5647430 

.1273043 

-7 

3 

4.0515675 

.  2384 

. 3745461 

. 5638178 

0 

1 . 0057284 

4 

4.0518629 

.  2205 

H 

.5644172 

-.4115544 

-9 

1 

1.0008063 

.5369 

-.1311961 

-5 

.5636032 

0 

1.0017363 

64 

2 

1.0008077 

.3398 

II 

.5647430 

0 

.  2462939 

-7 

3 

4.0128623 

.1343 

0 

. 2743681 

-7 

. 5638240 

0 

1 . 0035997 

4 

4 .0129086 

.  9792 

-1 

. 5644235 

0 

.3196172 

-8 

1 

1 . 0002018 

.4196 

-1 

. 5647430 

0 

.  3356056 

-5 

1 . 0064420 

128 

2 

1.0002031 

.4775 

-1 

.  7414162 

-6 

. 5636032 

0 

3 

4.0032196 

.9166 

-1 

. 2379072 

-6 

. 5638239 

o 

1 . 0010560 

4 

4 . 0032230 

.9745 

-2 

. 5644235 

0 

.1197135 

-5 

1 

1 .0000504 

.  4372 

-1 

. 5647429 

0 

. 1061527 

— 

-4 

1 .0218254 

256 

2 

1.0000515 

.  4614 

-1 

-.  1553659 

-4 

. 5636031 

0 

3 

4.0008054 

.  5011 

-1 

. 5644234 

0 

-.2123278 

-4 

1 . 0031040 

4.0008079 


7741  -1 


5638238 


1165012  -5 


NUMBER  OF  INTERVALS 


tors . 


We  consider  here  the  approximation  of  the  eigenpairs  of  gene¬ 
ral  second  order  elliptic  operators.  This  problem  is,  in  large 
part,  similar  to  those  discussed  in  Subsections  A.  and  B.  above; 
we  will  thus  be  brief,  discussing  in  detail  only  those  issues  that 
have  a  treatment  in  this  case  that  differs  from  that  for  the  L- 
shaped  panel  or  the  membrane,  or  those  issues  that  did  not  arise 
with  those  problems. 

Consider  the  eigenvalue  problem 


( 10.59) 


(Lu)(x)  =  MMu){x),  x  e  ft 
( Bu) ( x)  =0,  xeT  =80, 


where  0  is  a  polygonal  domain  in  R  ,  L  is  given  in  (3.2),  M 
in  (3.3),  and  B  in  (3.4),  L  is  assumed  to  be  uniformly  strong¬ 


ly  elliptic  (cf.  (3.5)),  a..,b.,c,  and  d  to  be  bounded  and 

^  J  1 


measurable,  and  d  to  be  bounded  below  by  a  positive  constant 
( cf .  Section  2 ) . 


In  Section  2  we  saw  that  (10.59)  has  the  variational  form 


(8.10),  ( cf . (3.18)  and  (3.20)),  with  H  =  H2 


Hq(Q)  in  the  case 


of  Dirichlet  boundary  conditions  and  Hj  =  H2  =  Hq(Q)  in  the  case 


of  Neumann  conditions,  and  a  and  b  given  in  (3.14).  (8.1)  - 

(8.3)  hold,  (8.2)  and  (8.3)  being  a  consequence  of  (3.5),  provided 


(10.60) 


p  ,  i  a0  b2 

Re  c(x,  ^  + 


where  b  =  max  'b.(x)!  (cf.  3.17).  We  remark  that  (10.60)  can 
xtQ  1 
i  =  l  ,  2 

be  easily  achieved.  It  does  not  hold  for  the  given  operator  L,  L 
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can  be  modified,  by  adding  an  appropriate  multiple  of  d(x)  to 

c(x) ,  so  that  it  does  hold.  This  change  shifts  the  eigenvalues 

and  leaves  the  eigenfunctions  unchanged.  We  also  see  that  (8.7) 

is  satisfies  with  =  W2  =  L^(0).  Thus  (10.59)  has  the  form  of 

the  problem  analyzed  in  Section  8. 

We  remark  that  in  this  subsection,  since  we  are  not  imposing 

any  selfadjointness  assumptions,  the  spaces  H  and  ^  ^  = 

S  must  be  taken  to  be  complex  and  the  eigenvalue  parameter 

^  >  ( n ,  p ) 

X  must  be  considered  complex. 

As  we  have  seen,  the  selection  of  the  trial  and  test  spaces 
S  .  and  S  ,  .  is  guided  by  the  regularity  properties  of 

the  exact  eigenfunctions  and  adjoint  eigenfunctions.  In  general, 
determining  this  regularity  and  then  using  it  to  choose  effective 
trial  and  test  spaces  is  a  delicate  task.  The  regularity  can 
depend  on  the  coefficients  in  the  differential  equation,  e.g.,  on 
where  they  have  discontinuities  and  where  they  are  smooth,  and  on 
the  domain,  as  we  have  seen  with  the  L-shaped  panel.  We  will  not 
go  further  in  this  direction,  but  will  instead  assume  the  eigen- 

kl 

functions  belong  to  H  (Q  )  and  the  adjoint  eigenfunctions  to 
k2 

H  (Q),  and  select  trial  and  test  spaces  so  as  to  reflect  this 
assumption. 

Remark  10.5.  For  eigenvalue  problems  with  rough  coefficients, 
which  arise  in  the  analysis  of  vibrations  in  structures  with  rapid¬ 
ly  changing  material  properties  (such  as  composite  materials)  it  is 
known  that  the  eigenfunctions  do  not  lie  in  any  high  order  Sobolev 
space.  Nevertheless,  for  one  dimensional  problems,  their  regular¬ 
ity  can  be  understood  and,  based  on  this  understanding,  one  can 
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I 

I 

I 

select  trial  and  test  spaces  that  lead  to  very  accurate  and  robust 
approximations.  These  trial  and  test  spaces  are  not  of  the  usual 
polynomial  type,  but  instead  closely  reflect  the  coefficients. 

For  details  see  Babuska  and  Osborn  [1983,  1985].  Cf .  also  Subsec¬ 
tion  11. C. 

Remark  10.6.  The  mathematical  study  of  the  use  of  regularity 
information  for  the  optimal  selection  of  trial  and  test  functions 
belongs  to  the  area  of  complexity  and  information  based  approaches 
See,  e.g.,  Wozniakowski  [1985]. 

Based  on  the  information  that  the  eigenfunctions  lie  in 

V  Ir 

1  2 
H  (0)  and  the  adjoint  eigenfunctions  in  H  (0),  with  klfk2  ' 

1,  it  is  appropriate  to  discretize  (8.10)  by  choosing 

Sq(t),  for  Dirichlet  conditions 
SP(t),  for  Neumann  conditions, 

as  in  Subsection  B.,  where  t  c  7  and  7  =  (T}  =  is  a  fami¬ 

ly  of  a-regular  triangulations  of  Q.  (8.11),  with  /3(h)  =  aQ/2, 
and  (8.12)  follow  from  (3.5).  (8.13)  follows  from  (10.40). 

(8.14)  (or  (8.15))  can  now  be  considered  and  from  it  we  get 

eigenpairs  (A,.  . ,u,.  .)  which  serve  as  approximations  to  the 

(h,p)  (h,p) 

eigenpairs  (X,u)  of  (10.59)  (or  (8.10)).  The  errors  in  the 
approximate  eigenpairs  can  be  estimated  with  the  results  of  Sec¬ 
tion  8. 

Let  \  be  an  eigenvalue  of  (10.59)  (or  (8.10))  with  alge¬ 
braic  multiplicity  m  (by  which  we  mean  that  A  1  is  an  eigen¬ 
value  with  algebraic  multiplicity  m  of  the  compact  operator  T 


^  S(h,p)  Sl, (h,p)  S2, (h,p)  =  I 


introduced  in  (8.8)).  Recall  that 


M(X)  =  the  unit  ball  (with  respect  to  H  (0 )  )  in  the  space 


of  generalized  eigenfunctions  associated  with  X , 

and 

* 

M  (X)  =  the  unit  ball  in  the  space  of  generalized  adjoint 
eigenvectors  associated  with  X . 


From  (10.40b)  in  the  case  of  the  Dirichlet  problem  and  (10.40a)  in 
the  case  of  the  Neumann  problem  we  have 


£ ( h, p) 


(X) 


sup  inf  ||  u  -  *1! 

ueM(X)  ireS. 

xl  r  P 


* 


h"1'1 

C— I r— r  sup 
1  ueM(X) 


u| 


krQ' 


where  n .  = 


min(p+l ,k1 ) ,  and 


£  <h.p)(M  =  sip 

veM  (X)  r?eS 


inf  II v  -  )?| 


1  ,o 


"2"1 


h,  p 


5  C- 


T^T  s?p  !iullk,.Q' 


veM  (1  ) 


where  y 2  =  min(p+l,k2). 

Let  X  ( h, p) , . . . , X  ( h , p)  be  the  eigenvalues  of  (8.14)  that 
converges  to  X ,  let 


M. 

h ,  p 


(u  :  u  in  the  direct  sum  of  the  generalized  eigen- 
spaces  of  (8.14)  corresponding  to  the  eigen¬ 
values  X  j  (  h ,  p ) . *  m  ( h  '  P )  '  liU,'i  o  = 


and  let  a  =  ascent  of  (X  1  -  T) . 

Applying  Theorem  8.2  we  have 
m 

(10.61)  |>  -  (i  y»H(h,p)-')-1|  •  Ct  . .  „,mc*  „.<M 


■ 


MX 


' k  +k  -2  11  U|1  k  M  V!,k  0  • 

p  1  2  u.rM(»)  lVM  V9 


In  light  of  Remark  8.1  we  also  have 

m 


U 


j  =  1 


(10.62) 


l/V'V2 

s  C~kTk~  SUp  |u|!k  M  SUp  l|vl,k  Q- 

p  1  2  ueM(X)  kl'MveM*(,)  k2'° 


From  Theorem  8.3  we  obtain 


(10.63)  \X-X (h,p)|  s  C 


H^2-2 


sup  Hull  sup  II  vi! 

p  1  2  U£MU)  1  veM  (\)  2 


1/a 


Regarding  eigenfunction  estimates,  we  apply  Theorem  8.1  and 
8.2.  From  Theorem  8.1  we  have 


( 10.64) 


6(M(\),M  (\))  <  CiiF-r T  sup  Hull 

1  'PJ  _  1  1  ueM(l  )  1  'U 


"l"1 


Let  \(h,p)  be  an  eigenvalue  of  (10.1)  (or  (8.10))  such  that 

lim  A(h,p)  =  *  and  let  w.,  .  is  a  unit  vector  satisfying 

h-*o  (h'p) 

P-*00  f 

- 1  1 

(\(h,p)  -  T)  w^  p)  =  0  for  some  positive  integer  <  a. 

Then,  from  Theorem  8.4,  for  any  integer  with  ^  <  a , 

- 1  ^2 

there  is  a  vector  u(,  ,  such  that  (\  -  T)  u,.  ,  =  0  and 

(h,p)  (h,p) 


(10.65)  iu,.  -  w,.  II  <  C 
(h,p)  (h,p)  1,0 


"l-1 


T^T  sup  "u!lk  o 

1  ueM(X)  1' 


(  W1)/a 


Remark  10.7.  In  this  section  we  have  considered  triangular  meshes 


One  could  also  consider  quadrilateral  meshes,  which  are  a  general¬ 
ization  of  the  type  of  mesh  employed  in  Subsection  A.,  or  curvili- 
linear  meshes.  Since  these  generalizations  properly  belong  to 
approximation  theory  we  will  not  pursue  them.  We  refer  the  reader 
to  Ciarlet  [1978],  Babuska  and  Guo  [1978b],  and  Szabo  [1986]. 

Remark  10,8.  We  have  mentioned  here  only  estimates  based  on  the 

k  2 

information  that  u  e  H  (0).  If  we  know,  e.g.,  that  u  e  H ^  (Q  )  , 

then  we  can  say  more,  provided  a  proper  mesh  is  selected. 

Remark  10.9.  The  approximate  eigenvalues  \^(h,p)  here,  as  in 
any  finite  element  method,  are  defined  by  the  eigenvalue  problem 
(8.14),  which  involves  integrals  over  the  domain  0 .  In  practice 
these  integrals  often  must  be  evaluated  (approximated)  by  quadra¬ 
ture  formulas.  For  estimates  of  the  eigenvalue  error  due  to  this 
quadrature  error  we  refer  to  Fix  [1972].  We  note  that  the  use  of 
a  finite  element  method  in  conjunction  with  a  quadrature  method 
often  leads  to  a  finite  different  method  for  eigenvalue  approxima¬ 
tion.  For  example,  if  we  approximate  the  eigenvalues  of 

{-An  =  \u,  on  Q 
u  =  0 ,  on  T 

with  the  finite  element  method  corresponding  to  p  =  1  and  a  uni¬ 
form  triangulation  (cf.  Figure  10.8)  and  evaluate  the  resulting 
Integrals  with  an  appropriate  quadrature  formula,  we  obtain  the 
standard  5-point  difference  eigenvalue  approximation  for  the 
Laplacian  (cf.  Section  5).  This  observation  is  due  to  Courant 
[1927,  1943].  For  further  results  on  finite  difference  methods  we 
refer  to  Polya  [1952],  Hersch  [1955,  1963],  Weinberger  [1956,  1958, 

1974],  Hubbard  [1961,  1962],  Kuttler  [1970a, b],  and  Kreiss  [1972], 


Remark  10 . 10 .  Since  the  eigenvalue  *  (h,p)  are  defined  by  a 

Ritz  method,  they  are  upper  bounds  for  the  exact  eigenvalues  A 


x3  5  Vh'p> 


(cf.  (8.42)).  If  we  could  derive  a  lower  bound  \j(h,p),  then 
one  would  have  bracketed  \j.  Much  attention  has  been  directed  to 
the  derivation  of  lower  bounds.  Weinstein  [1935,  1937,  1953,  1963] 
developed  the  method  of  intermediate  problems.  Many  authors  have 
contributed  to  the  development  of  this  and  other  related  varia¬ 
tional  methods.  We  mention  D.H.  Weinstein  [1934],  Aronszajn  and 
Weinstein  [1942],  Aronszajn  [1948,  1949-50],  Weinberger  [1952a, 
1952b,  1956,  1959,  1960],  Bazley  [1959],  Bazley  and  Fox  [1961a, 
1961b,  1963].  In  addition  we  mention  the  monographs  by  Collatz 
[1948],  Weinstein  &  Stenger  [1972],  and  Weinberger  [1974]. 

Remark  10.11.  Most  books  and  monographs  that  treat  finite  element 
methods  contain  a  section  or  chapter  on  eigenvalue  problems.  For 
a  survey  of  books  and  monographs  on  finite  element  methods  we  refer 
to  Noo*  [1985].  Of  the  more  mathematically  oriented  of  these,  we 
mention  Strang  and  Fix  [1973],  Oden  and  Reddy  [1976],  and  Oden  and 
Carey  [ 1982] . 


Section  11.  Approximation  by  Mixed  Methods 

In  Section  3  we  saw,  in  terms  of  an  example,  how  eigenvalue 
problems  can  be  given  mixed  formulations.  Mixed  formulations  can 
be  discretized  and  thereby  lead  to  approximation  methods  referred 
to  as  mixed  finite  element  methods.  In  this  section  we  discuss 
three  such  methods.  We  begin  by  presenting  an  abstract  result 
designed  for  the  analysis  of  mixed  methods. 

Remark  11.1.  Mixed  methods  for  source  problems  have  received  con 
siderable  attention.  We  mention  Hermann  [1967],  Glowinski  [1973] 
Johnson  [1973],  Oden  [1973],  Brezzi  [1974],  Ciarlet  and  Raviart 
[1974],  Mercier  [1974],  Scholz  [1976],  Raviart  and  Thomas  [1977], 
Brezzi  and  Raviart  [1978],  Falk  [1978],  Babuska,  Osborn,  and 
Pitkaranta  [1980],  and  Falk  and  Osborn  [1980]. 

A .  An  Abstract  Result 

Let  V , W , H  and  G  be  four  real  Hilbert  spaces  with  inner 
products  and  norms  II  *  II  v ,  (-,*)w,  II  •  li  M .  (*,’)H,  I!  *  li  H  '  an 

II  •  II G,  respectively.  We  assume  V  c  H  and  W  c  G.  Let 
A (a,<p)  and  B(#>,u)  be  bilinear  forms  on  H*H  and  VxW,  respec 
tively,  that  satisfy 

(11.  la)  |  A(a  ,ip)  |  s  c1ll<F|lHil  vll  H,  V  a  ,tp  e  H 

i  id 

(11-lb)  |  B  ( ^  ,  u)  |  <  C2ll*|lv!l't!lw.  V  ip  e  V,  u  e  W. 

We  assume  A{<7  ,ip)  is  symmetric  and  satisfies 
(11.2a)  A(a,a)  >  0,  V  0  *  o  H, 


and  assume 


ArVV 


(11 . 2b) 


sup  ! B ( » , u ) !  >0,  Vo*  u  e  W. 

*>eV 


We  then  consider  the  following  eigenvalue  problem: 


(11.3) 


(cr,u)  e  VxW,  (a,u)  +  (0,0) 

k{a,y>)  +  B{y>,u)  =  0 ,  V  y>  e  v 


B(cr,v)  =  -A.(u,v)Q,  V  v  e  W 


A  discretization  of  (11.3)  is  obtained  by  selecting  finite  dimen¬ 
sional  spaces  c  V  and  W^  c  w  and  considering  the  approximate 

eigenvalue  problem 


(11.4) 


(ah' uh^  €  VhxWh'  ^CTh'uh)  0  *0,0) 
k(ah,ip)  +  B(v>,uh)  -  0,  V  y>  e  Vh 

B(ah,v)  =  -‘h(uh,v)G,  V  v  c  Wh. 


We  then  view  ( , (o^ , uh ))  as  an  approximation  to  (\,(o,u)). 
Given  bases  for  and  W^,  (11.4)  becomes  a  matrix  eigenvalue 

problem . 

Remark  11.2.  If  we  let 

a(  (a  ,u)  ,  ( ip  ,v)  )  =  A  [a,  id)  +  B(p,u)  +  B(a,v), 
b  (  ( ,  u )  ,  (  v  <  v )  )  =  -(u,v)Q, 


H  =  V*W, 


then  (11.3)  can  be  written  as 


(11.5) 


(ct,u)  e  H  ,  (a  ,  u )  *  (0,0) 

a(  (a  ,u)  ,  (i/> ,  v)  )  =  »b(  (<r,u)  ,  («>,v)  )  ,  V  (  w  ,  v )  c  H, 


which  has  the  form  of  (8.10).  Also  (11.4)  has  the  form  of  (8.14 


with  S.  .  =  S_  .  =  V,  xW.  .  (11.3)  and  (11.5)  do  not,  however, 

l  f  n  2  ,  n  n  ii 

satisfy  all  of  the  hypotheses  of  the  results  in  Section  8.  We 
thus  need  an  alternative  analysis.  This  will  be  provided  by  Theo¬ 
rem  11.1,  which  is  based  on  the  results  of  Section  7.  Note  that 
even  though  the  methods  considered  in  this  and  the  next  section 
are  not  covered  by  the  results  of  Section  8,  it  is  still  useful  to 
discuss  them,  to  the  extent  possible,  in  terms  of  the  basic  step 
1),  1'),  2'),  and  3)  introduced  in  Section  10. 

In  order  to  estimate  the  error  in  the  approximate  eigenpairs 
(X.  ,  ^ah'Uh^  We  cons*^er  associated  source  and  approximate 

source  problems: 


(11.6) 


'Given  g  e  G,  find  (a, u)  e  V*W  satisfying 
>A (a, ip)  +  B(tl>,\l)  =  0,  V  ip  e  V 
.B(o,v )  =  -  (g  ,  V )  Q  ,  V  v  e  W; 


(11.7) 


Given  g  e  G,  find  (ah'uh^  e  Vh*Wh  satisfying 
A(crh,i p)  +  B(//>,uh)  =  0,  V  ip  e  Vh 
[B(a  v)  =  -(g,v)  ,  V  v  e  W  . 


We  assume  (11.6)  and  (11.7)  are  uniquely  solvable  for  each  g  e  G 
We  then  introduce  the  corresopnding  component  solution  operators: 


(  11.8a) 


(  11.8b) 


(11.8c) 


(  11 .8d) 


lv  ■  V 


where  (<7,u)  and  <ah'uh^  are  de^ined  bY  (11.6)  and  (11.7),  res¬ 
pectively.  (Note  that  the  T  introduced  here  is  different  from 
that  introduced  in  (8.8).) 

The  eigenpairs  (\,(cr,u))  of  (11.3)  can  be  characterized  in 
terms  of  the  operator  T.  Before  establishing  this  we  note  that 
A.  =  hS° ,a  2 ,  which  shows  that  X  >  0.  This  follows  from  (11.3) 

IU,ujg 

and  the  observation  that  both  components  u  and  a  of  an  eigen¬ 
vector  are  nonzero.  Now,  if  (A.,(cr,u))  is  an  eigenpair  of 

(11.3) ,  then  ITu  =  u,  u  *  0,  and  if  \Tu  =  u,  u  *  0,  then  there 

is  a  a  e  V  ( (7  =  S  ( A.u )  )  such  that  (X,(a,  u)  )  is  an  eigenpair  of 

(11.3) .  Thus  X  is  an  eigenvalue  of  (11.3)  if  and  only  if  \  1 

is  an  eigenvalue  of  T.  The  correspondence  between  eigenvectors 
is  given  by  u< — »(a,u).  In  a  similar  way  the  approximate  eigen¬ 
values  defined  by  (11.4)  can  be  characterized  in  terms  of  the 
eigenvalues  of  T^.  is  an  eigenvalue  of  (11.4)  if  and  only  if 

is  an  eigenvalue  of  T^;  the  correspondence  between  the 

eigenpairs  is  given  by  uh< - »((?h,uh)  . 

We  assume 


(11.9) 


,T  ~  VgG-*0  as  h— >0' 


where,  for  an  operator  A  :  D(A)(cX) — >Y,  we  let 

I!  Aw|!  „ 


!  Ail  XY  =  sup 


weD(A)  ,IWl|X 


(In  particular,  we  assume  T  is  a  bounded  operatorn  on  G.)  Since 
dim  R(T  )  <  <»  for  each  h,  the  T,  are  compact  and  (11.9)  thus 


implies  T  is  compact.  We  also  note  that  T  is  selfadjoint  on 
G.  This  is  seen  as  follows.  Let  v  =  Tf  in  the  second  equation 
in  (11.6)  to  obtain 

B { Sg , Tf )  =  - ( g . Tf ) G . 

Again  consider  (11.6),  but  with  g  replaced  by  f,  and  let  i  = 
Sg  in  the  first  equation  to  get 

A( Sf , Sg)  +  B(Sg.Tf)  =  0. 

From  these  two  equations  we  have 

(11.10)  (g,Tf)Q  =  A( Sf , Sg) ,  V  f , g  e  G. 

Using  (11.10)  and  the  symmetry  of  A  we  get 

( Tg , f ) G  =  (f,Tg)Q  =  A(Sg,Sf)  =  A(Sf,Sg)  =  (g,Tf)G< 

showing  T  is  selfajoint.  In  a  similar  way  we  see  Th  is  self- 
adjoint  . 

We  now  apply  Theorems  7.3  and  7.4  to  the  operator  T  and 
family  of  operators  (T^)  on  *he  sPace  G.  By  virtue  of  the 
correspondence  between  the  eigenpairs  of  T  and  T^  and  those  of 
(11.3)  and  (11.4)  we  will  thereby  obtain  estimates  for  the  errors 
in  (*h'^ah'uh^‘  The  hYPotheses  have  all  been  shown  to  be  satis¬ 
fied;  cf.  Remarks  7.5  and  7.6.  Let  A  1  be  an  eigenvalue  of  mul¬ 
tiplicity  m.  Since  I!  T  -  GG — >0  we  know  that  m  eigenvalues 

A.(h)  * . A.  (h)  1  of  T.  converge  to  A.  1  .  Since  T  and  T. 

l  m  n  n 

are  selfadjoint  the  relevant  ascents  are  one  and  all  eigenvalues 
have  equal  geometric  and  algebraic  multiplicities.  Let  M(\  1 ) 
be  the  eigenspace  of  T  corresponding  to  A  1 .  Recall  that  M  = 


M(\  )  =  R(E),  the  range  of  the  spectral  projection  E  associ¬ 

ated  with  T  and  1  * .  We  have  denoted  this  space  by  M  to  dis¬ 
tinguish  it  from  the  set  M  of  normalized  eigenvectors  introduced 
in  Section  8. 


Theorem  11.1.  Under  the  assumptions  made  above,  there  is  a  con¬ 
stant  C  such  that 


U-Vh>|  <  C<II(S-S  )|  !!'  ♦  ll(S-Sh)|  llGvll(T-T  )|  II 

M  MM 


(11.11) 


+  11  (T-T.  )  I  ||  ;r)  ,  t  =  1 . m. 

h  Is  GG 


Proof.  Let  u, ,  . . ,u  be  an  orthonormal  basis  for  M(\  1).  From 

-  l  m 

Theorem  7.3  with  a  =  1  we  have 

m 

lX‘1-Vh>-1'  !  C<  Z  H(T-th)u1.uj)Gl  +  MT-Th)|  ll*0), 

l.J-l  M 

<11.12)  ,  _  . 

<•  —  1 ,  .  .  .  ,  m . 


For  g,f  €  G  we  estimate  ((T  -  Th)g,f)G<  Adding  the  two 
equations  in  (11.6)  and  recalling  the  definitions  of  Tg  and  Sg 
in  (11.8)  we  find 

(g,v)G  =  -A(Sg,</>)  -  B(v>,Tg)  -  B(Sg,v),  V  {ip  ,v)  e  V«W. 

Setting  v  =  (T-T^)f  and  to  =>  (S-S^)f  yields 

(11.13)  (g,(T-Th)f)Q  =  -A(Sg, (S-Sh)f )  -  B ( ( S-Sh ) f , Tg ) 

-  B ( Sg , (T-Th)f ) . 

N<*x  *■  note  that  substraction  of  the  equations  (11.7)  from  (11.6) 


w ; * h  q  replaced  by  f)  gives 


0, 


(11.14)  A( (S-Sh)f ,y)  +  B(w,(T-Th)f)  +  B((S-Sh)f,v)  = 

(y>,v)  e  VhxWh> 

Now,  combining  (11.13)  and  (11.14)  and  using  (11.1)  we  have 
I  (g.  (T-Th)f  )Q|  =  |A(  (S-Sh)f,Sg-tp)  +  B  (  (  S-Sh )  f  ,  Tg-v) 

+  B { Sg-f , (T-Th)f ) I 
s  Cj  ||  ( S-Sh)  f  !l  Hl!  Sg-v!l  H 

+  C2!l  (S-Sh)f!lv!ITg-.v!!w 
+  C2II  Sg-yll  vll  (T-Th)f  lw. 

Setting  y  =  Shg  and  v  =  T^g  gives 

I  (  (T-Th)g.f  )Q|  5  c1l|(S-Sh)f||Hll  (S-Sh)g|lH 

(11.15)  +  C21I  (S-Sh)f||vli  (T-Th)g!iw 

+  C2;i  (S-Sh)g|lvll  (T-Th)fllw. 

Letting  g  =  Uj  and  f  =  u^  in  (11.15)  yields 

(11.16)  l((T-Th)u..uj)Gl  S  C1«(S-Sh)|MH*H 

+  2C2II(S-Sh>l  S"GVI!|T-Vl  s>'GW 

(11.11)  follows  immediately  from  (11.12)  and  (11.16). 

Theorem  11.2.  Under  the  assumptions  made  above,  there  is  a  con¬ 
stant  C  such  that 

u-Vo  -  c  <T-Vi„"GG- 


Proof .  This  result  is  an  immediate  consequence  of  Theorems  7.1 


and  7.4.  Note  that  we  have  given  this  estimate  the  simplified 


form  it  has  when  \  is  simple,  and  it  would  have  to  be  modified 
in  the  general  case.  Cf.  the  statement  of  Theorem  10.1  and  (8.44) 
-  (8.46) .  a 

Theorems  11.1  and  11.2  were  proved  by  Osborn  [1979]  and  by 
Mercier,  Osborn,  Rappaz ,  and  Raviart  [1981]. 


B.  A  Mixed  Method  for  the  Vibrating  Membrane 

We  consider,  as  in  Subsection  10. B.,  the  vibrating  mem¬ 
brane  problem 


(11.18) 


-Au  =  Xu,  in  Q 
u  =  0 ,  on  T  =  50 , 


2 

where  0  is  a  convex  polygon  in  R  ,  but  we  will  here  give  it  a 
mixed  variational  formulation.  Otherwise  we  will  proceed  in  a 
parallel  way,  discussing  in  turn  the  steps  1),  1'),  2),  and  3) 
introducted  in  Subsection  10. A.  We  will  clearly  see  how  the  vari¬ 
ational  formulation  influences  the  entire  approximation  method. 

Before  proceeding  with  the  variational  formulation,  we  intro¬ 
duce  an  additional  function  space.  Let 


H(div,Q)  *  {o  =  (o^,o 2)  :  °i'°2  e  H  (0)  and  there  exists 


z  *  div  a  e  H°(0)  such  that  f  dxdy  = 

'  J0 

-I 


z<7>  dxdy ,  V  ^  c  C Q  ( 0  )  }  , 


|l(7j| 


H  ( d  i  v ,  0  ) 


2  2  2 
[ fT ^  +  a 2  +  (div  a)  ]dxdy. 


0 


1)  Variational  Formulation 


Suppose  (A,u)  is  an  eigenpair  of  (11.18),  by  which  we  will 


mean 


(11.19) 


fo  m  U  e  hJ(O) 


7  u*  7  v  dxdy  =  X 

0 


uv  dxdy,  V  v  e  H0<ft), 


ft 


i.e.,  we  will  assume  (11.18)  to  have  the  variational  formulation 
considered  in  Subsection  10. B.  We  now  derive  a  mixed  variational 
formulation  for  (11.18).  Introduce  the  auxiliary  variable 


(11.20) 

From  (11.19)  we  see  that 

(11.21) 

From  (11.21)  we  get 


a  =  Vu. 

a  e  H(div,0)  and 
div  o  =  -Xu. 


(11.22) 


v  div  a  dxdy  =  -X 


Q 


uv  dxdy ,  V  v  e  h  ( ft ) 


0 


and  from  (11.20)  and  the  definition  of  H(div,A)  we  have 


(11.23) 


a  •  y>  dxdy  = 


0 


7  u  •  ip  dxdy  =  - 


u  div  ip  dxdy. 


Q 


V  i p  e  H(div,0 )  . 


Combining  (11.22)  and  (11.23)  we  obtain 


(11.24)  ] 


(cr,u)  e  H(div,0)xH  (Q),  (</,u)  (0,0) 


a  •  i(>  dxdy  + 


0 


u  div  ip  dxdy  =  0,  V  ip  e  H(div,0) 


0 


v  div  o  dxdy  =  -X 


0 


uv  dxdy ,  V  v  e  h  (ft ) 


Now  suppose  (*,('T,u))  satisfies  (11.24).  Let  u  be  the 


solution  to 


(11.25) 
and  let 


fAu  =  Au  in  ft 
[u  =  0  in  r 


a  =  Vu. 


Then,  by  the  argument  used  above, 


(11.26)  \ 


( (J  ,  u )  e  H(div,ft)xH  (ft) 


a  •  ip  dxdy  + 


ft 


u  div  ip  dxdy  =  0,  V  ip  e  H(div,fl) 


ft 


v  div  a  dxdy  =  -\ 


0 


uv  dxdy,  V  v  e  H  (ft) 


ft 


Subtraction  of  the  equations  in  (11.26)  from  those  in  (11.24) 
yields 


(11.27) 


(a-a,  u-u)  e  H(div ,ft ) x H° (ft ) 

{(j-o)'tp  dxdy  +  (u-u)div  v>  dxdy  =  0,  V  ip  e  H(div,ft) 
Jft  Jfl 

div(CT-<7)v  dxdy  =  0,  V  v  e  H°(ft)  . 

Jft 


In  (11.27),  if  in  the  second  equation  we  take  v  arbitrary  in 

H°(ft)  we  get  div(o-a)  =  0,  and  if  we  then  take  ip  =  a-a  in  the 

first  equation  we  obtain  0  =  j  [a-a )• [a-a ) dxdy ,  which  implies 

Jft 

(11.28)  a  =  a  . 


Then  the  first  equation  in  (11.27)  implies 


(11.29) 


(u-u)  div  ip  dxdy  =  0,  V  ip  e  H(div,ft). 


Let  w  satisfy  Aw  =  u-u  and  let  ip  =  7w  in  (11.29).  Since 


div  ip  =  u-u,  this  choice  leads  to 


(11.30) 


u  =  u 


(11.25),  (11.28),  and  (11.30)  show  that  (\,u)  is  an  eigenpair  of 


(11.18)  (or  (11.19)),  and  that  o  =  Vu. 


In  summary,  if  (A,u)  is  an  eigenpair  of  (11.18)  and  o  = 


?u,  then  (X  ,  (u,cr )  )  satisfies  (11.24),  and  if  (X,(cr,u))  satis¬ 


fies  (11.24),  then  (X,u)  is  an  eigenpair  of  (11.18)  and  a  =  7u. 


(11.24)  is  the  desired  mixed  formulation. 


It  is  immediate  that  (11.24)  has  the  form  of  (11.3)  with 


V  =  H(div,Q ) , 


W  =  G  =  (Q )  , 


H  *  )  , 


A  ( a  ,  ip  )  =  a  •  ip  dxdy , 


B{ip,\i)  =  u  div  ip  dxdy. 


Furthermore,  A  is  symmetric  and  (ll.l)  and  (11.2)  hold.  The 


symmetry  of  A  and  (11.1)  and  (11.2a)  are  trivial.  To  prove 


(11.2b),  let  w  solve  Aw  =  u  and  set  u>  =  Vw.  Then  div  u>  =  u 


and  we  have 


sup  |  u  div  ip  dxdy  |  >  |  u  div  ip  dxdy  | 
H( div ,0  )  J0  JQ 


u  dxdy 


'  .  ’  .  • 


>  0,  for  0  -  u  €  H  (tt  )  . 


which  proves  (11.2b). 


From  the  fact  that  (11.18)  has  a  sequence  of  positive  eigen¬ 
values  and  from  the  correspondence  between  the  eigenpairs  of 
(11.18)  and  (11.24)  we  see  that  (11.24)  has  a  sequence  of  eigen¬ 


values 


o<\1<x2<  ...S'* 


and  corresponding  eigenfunctions 


(<?1/U1)  ,  (a2  '  u2  )  .  •  •  •  . 


with  a.  =  7u.  and  with  the  (\  .  ,u.)  being  the  eigenpairs  of 
J  3  3  J 

(11.18)  . 

1')  Regularity  of  the  Eigenfunctions 

If  (u,u)  is  an  eigenfunction  of  (11.24),  then  u  is  an 
eigenfunction  of  (11.18)  and  a  =  7u.  Thus  the  regularity  of 
(a,u)  can  be  inferred  from  the  regularity  of  the  eigenfunction 
of  (11.16),  which  was  discussed  in  Subsection  11. B. 

2 )  Discretlzat ion  of  (11.24)  and  Assessment  of  the  Accuracy  of 
the  Approximate  F.iqenvalues  and  Eigenfunctions 

We  will  use  a  discretization  of  the  gonc?rai  foni  or  (it. 4). 

It  thus  remains  to  select  the  subspaces  c  H(div,0)  and  •- 

H°(0).  This  will  be  done  with  an  eye  toward  ensuring  (11.9)  holds 

and  the  terms  on  the  right  side  of  (11.11)  in  Theorem  11.1  are 
small.  A  mixed  method  approximation  of  the  associated  source 
problem  (cf.  (11.6)  and  (11.7))  has  been  proposed  and  analyzed  in 


Raviart  and  Thomas  (1977].  We  will  take  their  choice  of  trial  and 


test  functions.  The  source  problem  has  also  been  analyzed  by  Falk 
and  Osborn  [1980]. 

Let  T  be  the  unit  triangle  in  the  (<,*?)- plane  whose  verti¬ 
ces  are  a 1  =  (1,0),  a =  (0,1),  and  a3  =  (0,0).  Then  with 
p  >  0  an  even  integer  and  T  associate  the  space  Q^+1^  of  all 
functions  >i>  =  2)  of  the  form 


p/2  +  l7?p/2 


v>1  =  pol  p{Z,T))  +  «0^P+1  +  +  ...  +  ap/2? 

V»2  “  P°lp  ( K  iV)  +  /J07?P+1  +  +  ...  +  fip/2^P/2r)P/2  +  1  ' 


where  polp(i ?,j?)  denotes  an  arbitrary  polynomial  of  degree  p 


and  where 


p/2 

]T  (-l)1(ai-/31)  =  0, 

1=0 


and  with  p  >  1  an  odd  integer  and  T  associate  the  space 

q(p+d 


of  all 

v  of  the  form 

(K  ,V) 

+  a_^+1  +  a^t|  + 

.  .  .  +  a  .  iw  ,<P+D/2  (p+l)/2 

P 

0  1 

(p+l)/2 

p(^r,) 

+  /?0»?P+1  +  #1?'7P  + 

+  3  ?(P+D/2  (P+D/2 

•  •  •  +  S  (  p+ 1  )  /  2C  " 

where 


(P+D/2 


(P+D/2 


Z  (-i)iai  =  z 


(-1)  /Ji  =  0 


i=0 


i=0 


We  remark  that  for  u>  e  Q^+1^,  ip  and  «>2  are  polynomials  of 
degree  p+1 .  With  a  general  triangle  T  in  the  (x,y)-plane,  we 
associate  the  space  Q^+1^  defined  by 
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JLh  ILARJk&JkfjL  k  /  *^k  .V  »_«  •->.  'a  *^i  Ajt  -*  ■ — ■  -  — .. 


v. 


Qt  =  (V  :  »/>(x,y)  =  -Bt?(Ft  (x,y)),  ip  e  Q  }, 

where  F,j,(^,r?)  =  +  b^  is  the  linear  transformation  map¬ 

ping  T  onto  T  and  JT  =  det(BT). 

Let  y  -  {r}  =  {t^}  be  a  family  of  a-regular  triangular iza- 
tion  of  Q.  Then  for  p  >  0  an  integer  let 

(11.31a)  Vh  =  (v  <:  H  ( div  ,0  )  :  >p\y  Q^,P+1),  V  T  e  rh) 

and 

(11. 31b)  Wh  =  (u  e  H°(0)  :  u|T  polynomial  of  degree  p,  V  T  e  r  ) 

Now  we  consider  (11.4)  with  this  choice  for  V,  and  W.  . 

h  h 

(11.4)  will  have  eigenvalues 

Xl,h  s  - - $  , h 


and  corresponding  eigenfunctions 


where  N  =  dim  (V,  xW,  ).  It 

h  h 

applying  Theorems  11.1  and 

Theorem  11.3.  Let  V,  and 
-  n 

pose  the  eigenfunctions  of 


,h)  - - {aN,h'uN,h}  ' 

remains  to  derive  error  estimates  by 

11.2. 

Wh  be  selected  as  in  (11.31).  Sup- 
(11.18)  belong  to  HP+2(0).  Then 


(11.32)  Uk  h  -  \kl  <  C(p)h2p+2 

and 

111.33)  "U  ■  Vo.i  S 


Proof .  We  begin  by  showing  that  all  of  the  hypotheses  of  Theorems 
11.1  and  11.2  are  satisfied.  We  have  already  noted  that  A  is 


symmetric  and  that  (11.1)  and  (11.2)  are  satisfied  for  the  problem 
(11.24) . 

The  source  problem  (11.6)  is  uniquely  solvable  for  each  g  e 
G  -  H°(Q).  r n  fuct  t-he  unique  solution  is  (a,u),  where 


-Au  =  g 


u  e  Hq(0 


a  =  V  u 

(cf.  the  discussion  in  1)  above).  To  see  that  (11.7)  is  uniquely 
solvable  for  each  g  e  g  it  is  sufficient  to  show  that  g  =  0 
implies  and  u^  are  zero.  Now  g  =  0  implies,  using  the 

second  equation  in  (11.7),  that  B(ah,v)  =0,  V  v  e  w^.  Setting 
y>  =  <*h  in  the  first  equation  and  using  this  fact  shows  that 
A(ah,ah)  =  0  which,  together  with  (11.2a),  shows  that  c?h  =  0. 
Then,  using  the  first  equation  in  (11.6)  again  we  get  B(ip.u^)  =  0, 

V  ip  e  v^.  For  our  specific  problem  this  is  J  u^  div  i p  dxdy  =  0, 

V  *>  e  Vh-  It  is  shown  in  Raviart  and  Thomas  [1977,  Theorem  4] 

that  corresponding  to  any  u^  e  there  is  a  ip  e  such  that 

div  =  u^.  Using  this  ip  we  thus  have  J  |uh|2dxdy  =  0  which 

implies  u^  =  0 . 

It  remains  to  check  (11.9).  Falk  and  Osborn  [1980,  Section 


3(d)]  have  shown  that 


Tg  Th9li0,0 


(11.34) 


Ch  ||  Tg!l  2  Qf  for  p  >  1 
Ch  il Tg ii  g  Q,  for  p  =  0 


C  h  g  0,0'  f°r  P  ?  °' 


which  proves  (11.9). 

We  now  apply  Theorems  11.1  and  11.2.  From  Raviart  and  Thomas 
[1977,  Theorem  5]  we  have 


V 

v.: 


|S'Sh’91'  °,o,'5  !IIS'Sh,g6H(div,0)  s  ChP+I(“Tg»p«,Q  +  ll9"p+l,0 

IH  ( w  ) 


and 


<T-Thls»0,0  5  ChP+1(IIT9»p+2,Cl  +  ll9llp+l,Q»- 


If 

g  e 

(11. 

18) 

« . 

Thus 

(11. 

,35a) 

(11. 

.  35b) 

and 

(11 

.  35c) 

(11 

.32) 

(11 

.  35  )  . 

(11 

.  35c) 

-1 

k 


-1 
k  ! 


P+2  ,ft 


(S-Sh}lJ!  o  o 

M  H  (ft  )  ,HU(0  ) 


,  ChP+1 


(S-S  )  I  _(|  <  Ch 

M  H  (ft ) ,H(div,G) 


P+1 


(T-T  )  |  I! 

M  H  (ft ) ,H  (ft ) 


<  Chp+1 


Remark  11.3.  Theorems  11.1  and  11.2  estimate  the  errors  in  mixed 
method  approximation  of  eigenpairs  in  terms  of  error  estimates  for 
the  corresponding  source  problems.  For  our  problem,  these  were 
mainly  provided  by  the  results  of  Raviart  and  Thomas  [1977). 

Note,  however,  that  estimate  (11.34)  -  the  estimate  that  ensures 


the  approximate  eigenvalues  converge  -  is  not  proved  in  Raviart 
and  Thomas  [1977]. 


given  in  (11.31) 


3)  Solution  of  Matrix  Eigenvalue  Problem 

The  matrix  problem  (11.4)  with  Vh  and 
is  large  and  sparse,  but  is  not  positive  definite. 


C .  A  Mixed  Method  for  the  Vibrating  Plate 
The  eigenvalue  problem 


(11.36) 


2 

A  u  =  \u,  in  Q 

u  =  In  =  °'  ln  ° 


arises  in  connection  with  the  small,  transverse  vibration  of  a 
clamped  plate.  A  commonly  used  variational  formulation  of  (11.36) 
is 


(11.37) 


u  e  Hq  (Q  )  ,  u  *  0 

Au  Av  dxdy  =  \  uv  dxdy,  V  v  e  H^(Q)  . 

Q 


A  finite  element  method  based  on  (11.37)  would  require  trial  and 

2 

test  space  that  were  subspaces  of  HQ(ft),  and  this  would  require 
(^-elements,  i.e.,  piecewise  polynomials  that  are  C1  across 
inter-element  boundaries.  In  order  to  avoid  this  requirement  we 
will  use  a  different  variational  formulation  for  (11.37),  one  that 
permits  the  use  of  C^-elements.  We  do,  however,  use  (11.37)  to 
show  that  (11.36)  has  a  sequence  of  eigenvalues 

!^<*ls*2£  •  •  •  S'  * 

and  corresponding  eigenfunctions 


u. 


which  can  be  chosen  so  that 


A  u  .  A  u  .  dxdv  =  *  .  u  .  u  .  dxdy  =  <c>  .  . 

i  J  J  l  3  ij 

JQ  Jft 


1 )  Variational  Formulation 

Introduce  the  auxiliary  variable  o  =  -Au.  Then  (11.36)  can 
be  written  as  a  second  order  system: 


a  +  Au  =  0,  in  Q 
-Aa  =  Xu,  in  0 
u  =  =  o  on  f  . 


Multiplying  the  first  equation  by  ip,  the  second  by  v,  inte¬ 
grating  over  Q,  and  integrating  by  parts  leads  to 


oip  dxdy  +  Auv  dxdy 


v>  ds 


oip  dxdy  -  Vu •?¥>  + 


a  ip  dxdy  -  Vu-Vip  dxdy,  V  ip  e  H1  (0  ) 

■  f\  r\ 


\  uv  dxdy  =  -  Aav  dxdy 


=  Va-Vv  dxdy  -  ^ 

Jr 


v  ds 


? (T •  7 v  dxdy,  V  v  *  HQ(U) 


Thus  we  arrive  at  the  variational  formulation 
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V 


(cr,u)  e  H1(Q)xhJ(Q),  (a,  u)  *  (0,0) 


(11.38) 


oy>  dxdy 


V u dxdy  =  0 ,  V  y>  e  H  (0) 


Va*?v  dxdy  =  -A  uv  dxdy,  V  v  e  HQ (0 ) 


We  derived  (11.38)  formally  from  (11.36).  One  can,  however, 
easily  make  the  argument  rigorous  with  the  aid  of  a  well-known 
regularity  result:  If  w  is  the  solution  to 


A  w  =  f ,  in  0 
d  w 


w  =  on  r  , 


0  3 

where  0  is  a  convex  polygon  and  f  ~  H  (0)  ,  then  w  •:  H  (0) 


and  'i  w1'  3  g  <  C 11  f 11  q  q.  cf.  Grisvard  [1985]  and  Kellogg  and  Osborn 
[1975].  We  assume  0  is  a  convex  polygon  in  the  remainder  of  this 
subsection.  Using  this  result  we  can  show  that  if  (»,u)  is  an 
eigenpair  of  (11.36)  and  f  -  -Au,  then  i  »,(<■»,  u  )  )  is  an  eigen- 
pair  of  (11.3b)  .  and  if  (*,*<’,  u  I  1  is  an  eigenpair  of  (11.38), 
t  hen  *  . u )  is  an  e ’ genpa i r  cf  (11.36)  and  <»  =  -\u.  (11.36)  has 

the  form  :  •  11.3'  w  ;  ♦ h 


V  M  ‘ 


W  H 


\*  a’  • 


l-“  ■/  V*  *”•  •*.  ^ < 


;  v  svv 
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UNCLASSIFIED 


F/G  12/1 


B  ( »/>  ,  u )  =  7  i/>  •  7  u  dxdy . 

0 

It  is  easily  seen  that  A  is  symmetric  and  that  (11.1)  and  (11.2) 
are  satisfied. 

(11.38)  has  eigenvalues 

o<\1<\2<  . . .  y<x> 

and  corresponding  eigenfunctions 

(o1,n1),  2 • u2 )»...» 

with  Oj  =  -Au^. 

1')  Regularity  of  the  Eigenfunctions 

If  (o,u)  is  an  eigenfunction  of  (11.38)  then,  as  we  have 
seen  above,  u  is  an  eigenfunction  of  (11.36)  and  a  =  -Au,  and 
hence  the  regularity  of  (a, u)  can  be  inferred  from  the  regula¬ 
rity  properties  of  (11.36).  For  results  on  this  later  regularity 
question  we  refer  to  Grisvard  [1985)  and  Kellogg  and  Osborn  [1975]. 
2)  Discretization  of  (11.38)  and  Assessment  of  the  Accuracy  of 
the  Approximate  Eigenpairs 

As  in  Subsection  B.  above,  our  discretization  will  be  via 
(11.4).  For  our  specific  problem,  a  mixed  method  for  the  associ¬ 
ated  source  problem  has  been  studied  by  Glowinski  [1973],  Ciarlet 
and  Raviart  [1974],  Mercier  [1974],  and  Falk  and  Osborn  [1980]. 

We  will  use  the  same  trial  and  test  spaces  employed  in  those 
papers . 

Let  f  =  { r )  =  (r^)  be  a  family  of  o-regular,  q-quasiuniform 


triangulations  of  0.  Then  for  p  =  2,3 . 


let 


(11.39b) 


Wh  =  so(Th)  n  Ho{Q)- 


We  then  consider  (11.4)  with  these  choices.  We  will  have  approxi¬ 
mate  eigenvalues  and  eigenfunctions 


\ ,  .  s  ...  <  \  . 

1  ,h  N,h 


and 


(ai,h,Ul(h)  "  *  "  (°N,h'UN,h) ' 


where  N  =  dim  (V*W,  ). 

h  h 

Theorem  11.4.  Let  and  W^  be  as  in  ( 1 1 . 39 )  with  p  >  2  and 

suppose  the  eigenfunctions  of  (11.36)  belong  to  HP+1(Q).  Then 

(11.40)  |\kh  -  Xkl  <  C(p)h2p_2 
and 

(11.41)  II «k,h  -  Vo.o  *  0(P)hP- 

Proof .  The  symmetry  of  A  and  the  validity  of  (11.1)  and  (11.2) 
for  problem  (11.38)  have  already  been  noted. 

The  source  problem  (11.6)  is  uniquely  solvable  for  each  g  e 
G  =  H°(0).  The  unique  solution  is  (<r,u),  where 

A  2u  =  g 

« 

U  e  Hq ( 0 ) 


(cf.  the  derivation  of  (11.38)).  The  unique  solvability  of  (11.7) 
is  easily  checked. 
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Falk  and  Osborn  [1980,  Section  3a]  have  shown  that 


«*»  -  Th3l!0.0  5  Ch2|!T9!!3,0- 

This,  together  with  the  regularity  result  mentioned  above,  gives 

ll(T  -  V*'o.O  s  Ch2||9i!0,0' 

which  proves  (11.9) 

Thus,  all  of  the  hypotheses  for  Theorems  11.1  and  11.2  have 
been  verified  for  the  problem  under  consideration.  Using  the 
results  in  Falk  and  Osborn  [1980,  Section  3a],  we  have 


ll(S-Sh»9»o,0  s  ChP_1 1'  Ta#p+1,0- 

»<S-Sh>9»l,0  s  0hP'2“T9»ptl,0. 

II|T-V9»0,0  S  ChP|IT9Vl,0- 

and 

»'T-Th)3"1.0  5  ChP“T«Vl,0' 

from  which  we 

obtain 

( 11 . 42a) 

II  <S-Sh>  I  _ii  *  ChP-1 

M  H  (Q ) ,H  (0  ) 

(11.42b) 

It  (s-sv)  |  II  o  ,  S  Chp“2, 

M  H  (Q  )  ,H  (Q  ) 

(11.42c) 

MT-Th)l-!I  o  0  s  chP' 

M  H  (0)  ,H  (0) 

and 

( 1 1 . 42d) 

II  (T-T-  )  1  :!  n  .  S  Chp. 

M  H  (0 ) , Hq (Q ) 

(11.40)  follows  immediately  from  Theorem  11.1  and  (11.42),  and 

(11.41)  follows  from  Theorem  11.2  and  (11.42c). 
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Remark  11.4.  The  estimates  obtained  in  this  subsection  were  first 
obtained  by  Canuto  [1978].  We  note,  however,  that  the  estimation 
techniques  used  here  will  yield  an  improvement  over  the  estimates 
of  Canuto  in  the  case  when  the  eigenfunctions  have  low  regularity. 
Our  method  of  proof  does  not  yield  any  estimates  for  p  -  1.  For 
this  case,  see  Ishihara  [1978  a,b]. 

3)  Solution  of  Matrix  Eigenvalue  Problem 

See  subsection  A. 3)  above. 

For  further  results  in  eigenvalue  approximation  by  mixed 
methods,  and  also  by  hybrid  methods,  we  refer  to  Mercier,  Osborn, 
Rappaz,  and  Raviart  [1981],  Mercier  and  Rappaz  [1978],  and 
Ishihara  [1977]. 

Remark  11.5.  We  have  seen  in  this  Section  and  in  Section  11  that 
there  are  various  methods  available  for  the  approximate  calcula¬ 
tion  of  the  eigenvalues  of  a  specific  problem.  For  example,  we 
have  analyzed  two  methods  for  the  membrane  problem.  Furthermore, 
this  discussion,  together  with  that  in  Section  3,  shows  that  there 
are  many  more  possibilities.  Clearly  the  rational  choice  of  a 
method  for  any  particular  concrete  problem  is  important.  The 
effective  choice  of  a  method  is  complex,  depending  on  many  aspects 
of  the  underlying  problem. 

D .  A  Mixed  Method  for  a  Problem  in  One  Dimension  with  Rough  Coef¬ 
ficients 


Consider  the  problem 


This  is  a  special  case  of  the  problem  (1.8),  (1.9a)  discussed  in 
Section  1.  We  will  be  especially  interested  here  in  the  case  in 
which  the  coefficients  a(x)  and  b(x)  are  rough  functions.  Such 
problems  arise  in  the  analysis  of  the  vibrations  of  structures  with 
rapidly  varying  material  properties,  of  composite  materials,  for 
example.  In  Section  3,  we  gave  (11.43)  the  mixed  formulation 
(3.26)  (or  (3.27)).  In  this  subsection  we  analyze  a  mixed  method 
based  on  (3.27). 

Hence  we  consider  the  problem 


(11.44) 


(o  ,u) 


L2(0,1)*HJ(0,1) , (a,u) 


(0,0) 


1 

u  'ip  dx  =  0,  V  ip  e  L  ( 0 , 1  ) 

0 


1 

av'  dx 

0 


-A 


buv  dx,  V  v  e 


hJ(0. 


1) 


where  a(x)  is  of  bounded  variation  and  b(x)  is  measurable 
0  <  aQ  *  a(x)  <  a1 ,  0  <  bQ  <  b(x)  s  b  . 

(11.44)  is  of  the  form  (11.3)  with 

V  =  H  =  L2(0, 1) , 

W  =  Hj(O.l), 


G  =  L2 (0, 1 )  ,  with  (u, v)Q 


1 

buv  dx, 

0 


dx , 


and 
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•v'ZV 


s's; 
■  V  • 


A(<t  ,  p  ) 


eigenvalue  of  T.  Thus  one  eigenvalue  * 
\ .  By  Theorem  7.2  we  have 


of  (11.4)  converges  to 


(11.46)  \\  -  Xh\  <  C{  |  (  (T-Th)u,u)Gl  +  !!  (T-Th)u(.g)  , 

where  u  is  any  eigenfunction  of  T  corresponding  to  *  1  with 
liuli_  =  1.  We  now  proceed  to  analyze  ((T-T^),u)Q. 

From  (11.6)  we  have 


( (T-Th)u,u)G 
(11.47) 


1 

bu(T-T.  )u  dx 
n 

0 

-B(Su,  (T-T.)u) 
n 

A(Su, (S-Sh)u)  +  B( ( (S-Sh)u,Tu) 


B ( Su , (T-Th)u) 


and  from  (11.6)  and  (11.7)  we  get 

(11.48)  0  =  A((S-Sh)u.S)  +  B(  (S-Sh)u,rj)  +  B  ( K  ,  (  T-Th)  u )  , 

V  r/  e  Wh,  <  e  Vh. 

Combining  (11.47)  and  (11.48)  we  get 


(<T-Th)u.u)G  =  A(  (S-Sh)u,Su+U  +  B(  (S-Sh)u,Tu+r/) 

+  B(?-Su, (T-Th)u) , 

V  r,  €  Wh,  ?  e  Vh. 


which,  letting 

(11.49) 


j?  =  -T^u  and  Z  =  Shu,  yields 
((T-Th)u,u)G  =  A( (S-Sh)u, (S+Sh)u) 

=  2 A ( (S-Sh)u,Su)-A( (S-Sh)u, (S-Sh)u) . 


Now,  again  using  (11.6)  and  (11.7)  we  get 

a(  (S-S.)u.Su)  =  -B(  (S-S.  )  u  ,  Tu  ) 
h  h 

=  -B (  (S-SL  )u,Tu  -  Y\  (Tu)  )  . 


where  is  t^ie  W^-interpolant  of  Tu,  and  hence,  using 


1 

/ 

S^u[Tu  -  £hTu]  dx  =  0 

0 

and  (11.6)  we  have 

(11.50)  A( (S-Sh)u, Su)  =  -B ( Su, Tu  -  £hTu) 


bu[Tu  -  ^Tujdx, 


Finally,  combining  (11.49)  and  (11.50)  we  get 

1 
f 

(11.51) 


((T-Th)u,u)Q  =  2 


bu[Tu  -  £hTu]dx 

0 

,1 


I (S-Sh)u| 


dx 


=  2\ 


-1 


bu(u  -  £.u)dx 


2| (S-Sh)u|2 


dx , 


Now,  using  (11.50)  and  (11.46)  we  get 

1 


IX  -  \h(  <  C{| 


bu( u  -  £hu)dx|  +  Sq1!!  ( S-Sh)ui  L 


(11.52) 


+  II  (T-T.  )u,lf  ii  )  . 
h  L2 


It  remains  to  estimate  the  three  terms  on  the  right  side  of 
(11.52). 

Recall  that  £^u  is  the  W^-interpolant  of  u.  By  a 
of  Prosdorf  and  ‘Schmidt  [1981]  we  know  that 


resul t 


(11.53)  !!u  -  V  uii  s  ch  V  ( u' ) , 

*-*h  _  1  0 

L 

where  V^(u')  denotes  the  variation  of  u' .  Recall  that  u  is 
an  eigenfunction  of  (11.43)  with  !|vii|Q  =  1.  Since  a(x)  is  of 
bounded  variation,  u'  will  be  of  bounded  variation;  in  fact 

(11.54)  vj(u')  *  c' 

where  C  =  C(aQ,a1  .bQ,b1  ,V*(a)  .A )  depends  on  aQ  ,  a  ,  bQ  ,  b  ,  V*  ( a )  , 
and  A .  Also 

(11.55)  Hull  <  C. 

L 

Using  Holders  inequality,  together  with  (11.53)  -  (11.55),  we  get 


(11.56) 


1 

I  bu(u  -  £hu)dx| 
0 


<  I! bull  Hu  -  Ehutl  j 

L  Ll 

s  Ch2V ( a ) , 


where  C  =  C ( aQ , a^ , b^ , b^ , V* ( a) , A ) . 

Next  we  consider  il(S-S,  )  u||  _  and  II  ( T-T.  )u,;r  .  It  is  easily 

h  L2  h  L2 

seen  that  the  results  in  Falk  and  Osborn  [1980]  imply 


(11.57)  !!(S-Sh)u|!L  ,  !!(T-Th)u!!L  <  c  ( aQ  ,  ax  ,  bQ  ,  bx  ,  l  )  h . 

Note  that  (11.57)  shows  that  i!T-T.li__ — >0 . 

h  GG 

Finally,  combining  (11.52),  (11.56),  and  (11.57)  we  have 

Theorem  11.5.  Suppose  X  is  an  eigenvalue  of  (11.43)  (or  of 
(11.44))  and  let  A^  be  an  the  approximate  eigenvalue  defined  by 
11.4  with  Vh  and  defined  by  (11.45).  Then 

I A  -  A.  |  s  C ( an , a .  ,  b  ,  b. , V* ( a ) ,  A  )h2. 


4 


•:y 


(11.58) 


The  striking  feature  of  estimate  (11.58) 


that  the  constant 


C  depends  on  the  bounds  a0'ai,bo'  and  bi  and  on  Vg(a)<  but 
is  otherwise  independent  of  a(x)  and  b(x) .  This  shows  that  the 
approximation  method  is  effective  for  problems  with  rough  coeffi¬ 
cients  (cf.  discussion  of  alternate  variational  formulations  at  the 
end  of  Section  3).  In  fact,  the  rate  of  convergence  indicated  by 
(11.58)  is  the  same  as  that  for  the  usual  Ritz  method  for  problems 
with  smooth  coefficients.  (11.58)  was  proved  by  Banerjee  [1987]. 
The  use  of  mixed  methods  for  eigenvalue  approximation  for  problems 
with  rough  coefficients  was  first  suggested  by  Nemat-Nasser  [1972a, 
1972b,  1974].  Rate  of  convergence  estimates  for  several  such  mixed 
method  were  derived  by  Babuska  and  Osborn  [1978]. 


Remark  11.6.  It  is  of  interest  to  note  that  the  variable  rr  can 
-  h 

be  eliminated  from  11.4  in  the  present  context  (i.e.,  with  the 

choices  for  V, W, H , G , A, B , V.  ,  and  W,  we  have  made  in  this  sub- 

h  h 

section)  leading  to  the  problem 


( 11.59; 


uh 6  wh 
M 


I 

i  =  l 


a  u'v'dx  =  A  . 
r  h  h 


T. 

l 


bu,  v  dx ,  V  v  e  W,  , 
h  h 


where  ar  is  a  step  function  with  ar 
1 , . . . , M ( r ) .  Thus  (11.59)  differs  from 


the 


[_diam 

usual 


dx 

a 


Ritz 


-1 

,  i  = 
method 


only 


in  that  the  coefficient  enters  the  calculation  through  its  harmo¬ 
nic  averages  over  the  subintervals  of  the  mesh  instead  of  through 
its  averages. 


Section  12.  Methods  Based  on  One  Parameter  Families  of  Varia¬ 
tional  Formulations 

In  our  treatment  of  the  membrane  problem  in  Subsection  10. B., 
the  trial  and  test  functions  satisfied  the  essential  boundary  con¬ 
dition  u  =  0  (cf.  (10.42)).  In  fact,  if  one  bases  the  approxima¬ 
tion  method  on  the  usual  variational  formulation  (10.39),  one  must 
impose  the  boundary  condition  on  the  trial  and  test  functions. 

To  avoid  this,  methods  have  been  developed  that  use  test  and  trial 
functions  that  are  not  required  to  satisfy  essential  boundary  con¬ 
ditions.  (See  the  discussion  of  essential  and  natural  boundary 
conditions  in  Section  3.)  In  this  section  we  discuss  two  such 
methods.  They  are  both  based  on  one  parameter  family  of  variation¬ 
al  formulations.  We  will  be  rather  brief  and  will  not  explicitly 
discuss  each  of  the  steps  1),  1'),  2),  and  3)  of  finite  element 
approximation  outlined  in  Section  10. 


A .  The  Least  Squares  Method 

Consider,  as  in  Subsections  10. B.  and  12.B.,  the  membrane 


problem 

(12.1) 


-Au  =  Xu  in  0 
u  =  0  on  r  =  dQ , 


where  Q  is  a  bounded,  open  set  with  boundary  r,  which,  for  the 
sake  of  simplicity,  we  assume  to  be  of  class  C® .  Note  that  we  are 
not  assuming  0  to  be  a  polygon.  (12.1)  has  eigenvalues 


and  eigenfunctions 


0  <  \1  <-  *  2  «  ••./"«> 


u . 


u 


2  ' 
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We  begin  by  introducing  the  least  squares  method  for  the  cor¬ 


responding  source  problem. 


(12.2) 


'-Aw  =  f  in  0 
I  w  =  0  on  r  , 


which  is  usually  given  the  variational  formulation  (see  Remark  12.1 
for  the  reason  for  using  complex  functions  here), 


(12.3) 


W  e  Hj(0) 


VwV v  dxdy  = 
Q 


fv  dxdy,  V  v  e  HQ(Q) . 


0 


We  now  give  (12.2)  a  different  variational  formulation.  w  solves 
(12.2)  if  and  only  if 


(12.4)  \ 


w  e  H  (0) 


AwAv  dxdy  +  p 


wv  dxdy  =  - 


fAv  dxdy. 


V  v  e  H  (0  )  ,  V  0  <  h  s  1  , 


where  p  =  p^  >  l  is  a  parameter  that  approaches  •*>  as  h — >0 . 

To  pass  from  (12.2)  to  (12.4)  is  immediate.  To  go  from  (12.4)  to 

2 

(12.2)  we  proceed  as  follows.  First  take  v  e  H  (0)  to  satisfy 

fA  v  =  A w  +  f  in  Q 
v  =  0  on  f  . 


This  chioce  for  v  in  (12.4)  yields  -Aw  =  f  in  Q .  The  equa¬ 
tion  in  (12.4)  then  becomes 

p  wv  dx  =  0 ,  V  v  •“  H2(Q), 

0 

which  implies  w  =  0  on  r .  In  (12.4)  the  boundary  conditions 


w  =  0  is  not  explicitly  imposed.  This  is  the  major  advantage  of 
the  formulation  (12.4)  over  (12.3)  for  our  purposes.  We  note  that 
w  can  also  be  characterized  by  an  extremal  property:  the  solu¬ 
tion  w  of  (12.4)  is  the  unique  minimizer  of  the  functional 


I -A v  -  f|2  dxdy  +  p  |v|2ds 


over  v  e  H  (Q ) . 

In  order  to  discretize  (12.4)  we  suppose  we  have  a  family  y 
=  (r)  =  (Th)  of  triangulations  of  (T-,  where  Q'  is  some  fixed 
rectangle  containing  Q.  Then  let 

Sh  =  Sp,2(rh)  =  (u  e  H2 (O' )  :  ujT  =  a  polynomial  of 

dog ree  p ,  V  T  e 

and  let  consist  of  the  restrictions  of  functions  in  Sp,2(rh) 

to  0.  The  family  satisfies  the  following  approximation 

result:  If  p  >  5,  then 

2 

(12.5)  inf  ^  h^||  v-.r  il  Q  <  Chti|vilt  for  2  <  t  <  p+1. 

*eSh  j=0 

See  Ciarlet  [1978]  for  a  proof  of  (12.5).  Then  we  define  an 
approximate  solution  w^  to  w  by  letting  w^  be  the  unique 
solution  to 


(12.6) 


wh  e  sh 


Aw.Av  dxdy  +  p  w,  v  ds  =  -  f  Av  dxdy,  V  v  ^  S.  . 


w^  is  called  the  least  squares  approximation  to  w  since  it  can 
be  alternately  characterized  as  the  unique  minimizer  of 


over  v  e  s 


I -4v  -  f |  dxdy  +  p l l v |  ds 
Q  * 


Bramble  and  Schatz  [1970]  proposed  and  analyzed 

-3  _  3 

this  method  for  p  =  p.  =  h  .  The  also  showed  p  =  h  to  be 

h 

the  optimal  choice  for  p. 

Now  we  return  to  the  eigenvalue  problem  (12.1).  Proceeding 
in  a  similar  way  we  see  that  (12.1)  has  the  variational  formula¬ 
tion 


(12.7)  \ 


u  e  H  (Q) 

A  u  Av  dxdy  +  p 
Q 


uv  ds  =  -A 


uAv  dxdy,  V  v  e  H  (Q ) 


O 


(12.7)  is  then  discretized  by 


A ,  complex,  0  *  u,  e  S. 


(12.8) 


Au^Av  dxdy  +  p 


Q 


uv  ds  =  -A, 


u^Av  dxdy,  V  v  e  . 


Q 


(12.8)  has  eigenpairs  ( A  j  h'uj  ^)  '  i  =  where  N  =  dim  Sh> 

If  for  f  e  H°(Q)  we  define  Tf  =  w  and  T.f  =  w,  ,  where 

h  h 

w  and  w^  are  defined  by  (12.2)  (or  (12.4))  and  (12.6),  respec¬ 
tively,  then  we  easily  see  that  (A,u)  is  an  eigenpair  of  (12.1) 
if  and  only  if  (p  =  A  1,u)  is  an  eigenpair  of  T  and 
is  an  eigenpair  of  (12.8)  if  and  only  if  (p^  =  A^.u^)  is  an 
eigenpair  of  T^.  We  will  estimate  the  error  in  (^h'uh^'  and 
thus  in  (*h'uh^'  bY  applying  the  results  in  Section  7.  T  and 
Th  are  clearly  compact  on  H°(Q).  We  will  show  HT-T^n — >0  in 
the  next  paragraph. 

In  order  to  apply  Theorem  7.3  on  H°(Oj  we  need  estimates  for 


where  u  is  an 


((T-T.)u,u),  !!(T-T,  )u!'  and  !i(T-T,  )u"  „ 

h  h  0,0  h  0 ,  0 

eigenfunction  of  (12.1)  corresponding  to  the  eigenvalue  *  (or  ») 
we  are  approximating.  These  estimates  are  all  contained  in  Bramble 
and  Schatz  [1970]  (and  also  in  Baker  [1973])  for  the  choice  p  = 
h  .  In  their  Corollary  4.1  take  y  =  3/2,  \  =  t-2,  g  =  0,  Z  = 

-s,  and  r  =  p+1  to  get 


(12.9) 


l(<T-Th)».*)0,0l  *  Chs+tll*llt-2y»,|!s  Q.  for 

0  <  s  <  p-3,  2  <  t  <  p+1. 


Taking  s  =  0  and  t  =  2  in  (12.9)  shows  that  :|T-T.  ' — >0 . 

n 

take  s  =  p-3  and  t  =  p+1  to  obtain 

112.10,  l((T-thW.*)0>0l  «  Ch2P-2H«ll  p-i,Q!l^‘!p_3  ,0. 


Now 


take  s  =  0  and  t  =  p+1  to  obtain 


"<T-Th>^*»o,o'  s  chP+1|l«»p-i,Q«*"i0,0’ 


and  hence 
(12.11) 


<<T-V*>'!0,0  5  chP+1|l«»p-i,0' 


and  take  s  =  p-3  and  t  =  2  to  obtain 


l((T-Th)*.v)0>Q|  =  |  {*,  (T-Th*)?)0  ()|  <  ChP  1||^|U 


0,0  "  p-3 ,0 ' 


and  hence 

(12.12,  l"T-Th‘|»,ll0  0  S  ChP-1l,„llp_3i0 


Theorem  12.1.  Suppose  the  approximate  eigenpairs  (*  .  ,  ,u.  .  )  are 
-  j,h  j,h 

-  3 

defined  by  (12.8)  with  p  =  h  and  suppose  the  eigenfunctions  of 
(12.1)  belong  to  Hp_1(0).  Then 


and 

(12-14)  l!uk,h  -  Vo.o  s  ch^- 


Proof .  Let  X k  be  any  eigenvalue  of  (12.1)  and  suppose  its  geo¬ 
metric  multiplicity  is  q,  i.e.,  the  geometric  multiplicity  of 
/uk  =  X^1  is  q.  Since  T  is  selfadjoint,  the  ascent  is  one  and 
the  algebraic  multiplicity  of  is  also  q.  q  of  the  X  .  ^ 

will  converge  to  X^.  Let  X^  h  be  one  of  them.  Theorem  7.3  can 
now  be  applied  and  (12.13)  follows  directly  from  (7.15)  and  (12.10) 
-  (12.12)  since  all  of  the  eigenfunctions  of  (12.1)  corresponding 
to  X k  belong  to  1(Q).  (12.14)  follows  from  Theorems  7.1  and 

7 . 4  and  (12.11).  a 

Remark  12.1.  Even  though  (12.1)  is  selfadjoint,  (12.8)  is  a  non- 
selfad joint  (finite  dimensinal)  problem.  Thus  one  needs  the  gene¬ 
ral  (not  necessarily  selfadjoint)  theory  in  Section  7  to  analyze 
the  least  squares  method.  The  nonselfad jointness  of  (12.8)  is  the 
reason  we  have  used  complex  function  spaces  in  this  analysis. 

B .  The  Penalty  Method 

We  will  once  more  consider  the  membrane  eigenvalue  problem 
(10.38)  and  assume  the  boundary  T  of  Q  is  of  class  C*  (cf. 
also  (12.1)).  In  Section  10  we  gave  this  problem  the  variational 
formulation 


(12.15) 


where 
( 12 . 16a) 


fu  e  Hj(0) 

[a  ( u ,  v )  =  X  b  ( u ,  v )  ,  V  v  e  H*(Q), 


a  (  u ,  v ) 


V  U • V  V  dxdy 

0 
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( 12  . 16b) 


b(u, v) 


uv  dxdy . 

Q 


Let  us  replace  the  boundary  condition  u  =  0  on  r  in 

(10.38)  by  u  +  ip  =  0,  i.e.,  let  us  consider  the  problem 

an 


(12.17) 


-Au  =  \u  in  0 

-1  3u  _  r 

u  +  tfi  rr—  =  0  on  r  , 
3n 


where  y>  -  tp^  >  l  is  a  parameter  that  approaches  -hx>  as  ti¬ 
lt  is  easily  seen  that  (12.17)  has  the  variational  form 


( 12.18) 

where 


ru  e  H  (0) 


j^(u,v)  =  Ab(u,v)  ,  V  v  e  H  (Q  )  , 


(12.19) 


av(u.v)  - 


Vu*7u  dxdy  + 


Q 


uv  ds, 


Q 


Note  that  in  (12.18),  in  contrast  to  (12.15),  we  have  not  imposed 
any  constraint  on  either  u  or  v.  This  is  the  case  since 
u +ip  =  0  is  a  natural  boundary  condition  (cf.  Section  3). 

We  now  estimate  the  error  between  the  eigenvalues  and  eigen¬ 
vectors  of  (12.15)  and  (12.18).  Toward  this  end  consider  the  cor 
responding  source  problems: 


(12.20) 

and 


(12.21) 


—A  Uq  =  f  in  0 
u  =  0  on  f 


-Au  =  f  in  Q 

-1  du  „ 

u  +  ip  =  0  on  T. 

v  an 


We  view  (12.21)  as  an  approximation  to  (12.20).  Denoting  by  u 


(respectively  u^ )  the  solution  of  (12.20)  (respectively  (12.21)) 
we  are  interested  in  estimating  u  -  un.  It  is  shown  in  Babuska 

If)  o 

and  Aziz  [1973,  Section  7.2]  that 

( 12.22)  =  uQ  -  +  r  , 

where  Z  is  the  solution  of  the  problem 

'-A*:  +  *  =  0  in  0 

(12.23)  Su 

K  -  in  0 

dn 

x 

and  t;  is  the  solution  to 

ft  ®  H1(Q) 

(12.24)  \  _j  i 

(a^R.v)  =  y>  a(? , v) ,  V  v  e  H  (Q  )  . 

From  (12.16),  (12.19),  and  (12.24)  we  have 

!K  I! 2 ,  i  a (<  <)  =  v~1{a(Z,K) 

H1  (0  )  9 

-  v~1\\Z\\  .  IK  II  . 

H  (Q)  H  (Q) 

and  hence 

(12.25)  IK  II  ,  s  ^_1|RI! 

H X(Q)  H  (Q) 

From  (12.23)  and  (12.25)  we  obtain 

(12.26)  ||u  -  u  ||  <  2v_1IKii  . 

V  H  (Q)  H1 (0 ) 

From  (12.23)  and  regularity  results  for  elliptic  boundary  value 
problems  we  get 

(12.27)  IK  II  .  s  Cilfil 

H  (Q)  H  (0) 


Combining  (12.26)  and  (12.27)  yields 


(12.28) 


"  V  1 

v  H  (0) 


s  Cv"  llfl! 


H°(Q) 


If  we  now  introduce  the  operators  T  and  on  H  ( Q )  by 

Tf  = 


and 


T  f  =  u  , 

y>  v 


then  (12.28)  implies  that 

(12.29)  IKT-T  )f|| 


o  5  °*‘1|i£iio,q- 
H  (Q) 


It  is  immediate  that  (1.  ,u)  is  an  eigenpair  of  (12.15)  if  an  only 

if  (n  =  X  1 ,u)  is  an  eigenpair  of  T;  likewise  (X^.u^)  is  an 

eigenpair  of  (12.18)  if  and  only  if  /<  =  (X  1,u  )  is  an  eigen- 

V  V  y> 

pair  of  T  .  It  thus  follows  immediately  from  Theorems  7.1  -  7.4 
and  (12.29)  that 

(12.30)  |  X  .  -  X  .  |  <  o”1 

J  V> ,  3 

and 

(12.31)  !!u.-u  II  s  Cip  1 , 

j  V>.3  Hi(ft) 

where  (Xu.)  and  (X  ,  u  .)  denote  the  eigenpairs  of 

J  J  vP  /  J  W  $  J 

(12.15)  and  (12.18),  respectively.  Note  that  (12.30)  and  (12.31) 
are  estimates  of  the  same  order  for  both  the  eigenvalue  and  eigen¬ 
vector  errors.  This  is  in  contrast  to  approximations  we  have  ana¬ 
lyzed  previously  in  this  article.  An  analysis  of  a  one-dimensional 
model  problem  shows  that,  for  the  type  of  approximations  we  are 
considering,  the  eigenvalue  and  eigenvector  error  is,  indeed,  of 
the  same  order. 

Next  we  consider  the  problem  (12.18)  and  approximate  it  by  a 
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finite  element  method,  letting  the  resulting  eigenpairs  be 
(*  *  K«u  .  .  ).  Since  u  and  v  in  (12.18)  are  taken  in 

H1(0),  we  need  not  impose  any  boundary  condition  on  the  trial  and 
test  space  Sh-  If  one  now  analyzes  the  error  in  the  finite  ele¬ 
ment  approximation  of  (12.18),  selects  v>  so  that  the  error  in 
passing  from  (12.15)  to  (12.18)  is  of  the  same  magnitude  of  that 
incurred  in  the  finite  element  approximation  of  (12.18),  and  then 
combines  the  error  estimates  (12.30)  and  (12.31)  with  those  for 
the  finit  element  approximations  of  (12.18),  one  obtains  esti¬ 
mates  for  the  difference  between  (A.,u.)  and  (A.  .  ,  ,u  .  .  ). 

j  J  ^  #  j  /  n  ^  /  j  *  n 

We  stress  that  the  (A  .  .  ,u  .  .  )  are  calculated  from  a  matrix 

v  ,  j  /  h  ip  ,  j  ,  h 

eigenvalue  problem  corresponding  to  trial  and  test  spaces  that  are 
not  required  to  satisfy  the  essential  boundary  condition  for  the 
membrane  problem  (12.1).  The  approximation  method  we  have  out¬ 
lined  is  referred  to  as  the  penalty  method. 

We  refer  the  reader  to  Babuska  and  Aziz  [1973,  Section  7.2] 
for  a  detailed  analysis  of  the  penalty  method  for  the  source  prob¬ 
lem.  Estimates  for  the  errors  in  eigenvalue  approximation  can  be 

easily  derived  from  the  corresponding  source  problem  estimates  by 

means  of  Theorems  7.1  -  7.4.  Because  this  application  of  these 
error  estimates  to  the  eigenvalue  problem  is  similar  to  those  dis¬ 
cussed  above  and  raises  no  new  issues,  we  will  not  give  a  formal 
statement  of  the  results. 

Remark  12.2.  If  0  is  a  polygon,  then  the  choice  ip  =  ®  corres¬ 
ponds  to  satisfying  the  boundary  condition  on  <50,  i.e.,  con¬ 

straining  S^(t)  to  be  Sq(t),  and  the  resulting  method  is  iden¬ 
tical  with  that  discussed  in  the  Subsection  10. B.  If  F  is  not 


polygonal,  then  »  =  oo  will  lead  to  the  constraint  Sp(r)  =  §J^(t), 
where  5P(r)  consists  of  those  u  e  SP(r)  which  are  zero  on  every 
triangle  which  intersects  r .  The  finite  element  solution  then 
solves  the  problem  on  0  instead  of  Q,  where  Q  consists  of 
the  union  of  all  triangles  which  do  not  intersect  T .  Sometimes 
the  mesh  is  constructed  so  that  0  -  0  is  as  small  as  possible  by 
interpolating  r  by  straight  lines. 

Remark  12.3.  In  practical  computation  (codes)  the  penalty  method 
(or  some  equivalent  method)  is  often  also  used  when  Q  is  a  poly- 

O 

gon  by  taking  ip  to  be  very  large  (say  ip  =  10  )  .  This  is  just 
a  way  of  imposing  the  essential  boundary  conditions  in  the  code. 

Remark  12.4.  The  least  squares  method  and  penalty  method  are 
seldom  used  as  a  way  to  treat  essential  boundary  conditions  on  a 
curved  boundary  because  of  the  difficulty  in  the  computation  of 
a^(u,v),  which  requires  area  integrations  over  triangles  which 
intersect  the  boundary.  The  usual  approach  is  to  use  curvilinear 
elements,  which  allow  exact  satisfaction  of  the  boundary  condition 
in  a  similar  way  as  when  the  domain  is  polygonal  (cf.  Remark  12.3). 

Let  us  end  this  section  by  noting  some  similarities  and  dif¬ 
ferences  in  the  least  squares  and  penalty  methods. 

•  Both  methods  circumvent  essential  boundary  conditions  by 
reformulating  the  original  problem  in  terms  of  a  one  para¬ 
meter  family  of  variational  formulations.  In  both  methods, 
the  optimal  value  of  the  parameter  depends  on  the  mesh, 

i .e. ,  on  h. 

•  With  the  least  squares  method,  the  optimal  value  of  the 

-3 

parameter  (p  =  h  )  is  independent  of  the  solution. 


This  is  related  to  the  fact  that  the  alternate  variational 
formulation  characterizes  the  solution  exactly  for  any 
value  of  the  parameter.  In  the  case  of  the  penalty 
method, the  optimal  value  of  the  parameter  depends  on  the 
mesh  and  the  smoothness  of  the  solution  or  the  eigenfunc¬ 
tion.  This  is  related  to  the  fact  that  the  exact  solution 
does  not  exactly  satisfy  the  one  parameter  family  of  for¬ 
mulations  for  any  value  of  the  parameter  y>  *  +<». 

The  least  squares  method  employs  (^-elements  (i.e.,  sub- 
2 

spaces  of  H  (Q)),  whereas  the  penalty  method  employs 
C^-elements  (i.e.,  subspaces  of  H*(Q)).  As  we  have  pre¬ 
viously  noted,  C°-elements  are  easier  to  construct  than 
(^-elements . 


Section  13.  Concluding  Remarks 

A.  We  have  illustrated  the  application  of  the  general  theory  that 
was  presented  in  Chapter  II  by  considering  several  important  model 
problems.  It  should  be  clear  from  the  analysis  of  these  model 
problems  how  to  treat  a  wide  variety  of  problems.  We  have  seen, 
however,  that  the  application  of  the  general  theory  to  a  concrete 
problem  may  require  subtle  analysis. 

B.  In  Sections  10,  11,  and  12  we  have  illustrated  the  main 
approach  to  finite  element  approximation  of  eigenvalue  problems. 

We  have  seen  that  there  are  many  available  methods  and  that  their 
basic  theoretical  properties  can  be  established  as  an  application 
of  the  results  in  Chapter  II.  Nevertheless,  the  implementation  of 
these  methods  raises  many  other  important  questions;  although  we 
cannot  address  these  questions  in  detail,  we  now  mention  some  of 
them . 

1)  Which  method  is  most  effective  for  a  specific  problem? 

What  is  the  goal  of  the  computation?  We  remark  that  sometimes  high 
accuracy  is  achieved  for  eigenvalue  approximation,  but  that  only 
low  accuracy  is  obtained  for  the  approximation  of  other  important 
quantities  such  as  the  stresses,  moments,  or  shear  forces. 

2)  What  types  of  meshes  or  adaptive  mesh  procedures  are 
desirable?  How  should  the  quality  of  the  computed  results  be 
assessed  a  posteriori?  For  a  survey  of  results  in  this  direction, 
see  Noor  and  Babuska  [1987], 

3)  Which  matrix  eigenvalue  solvers  should  be  used?  What 
computer  architecture  is  desirable  (sequential,  parallel)? 

These  questions  are,  of  course,  not  restricted  to  eigenvalue 


computation.  They  also  arise  with  finite  element  computation  of 
source  problems.  Some  of  these  questions  may  be  addressed  in 
other  articles  in  this  Handbook. 

C.  The  Ritz  method,  which  was  discussed  in  Section  10,  is  most 

easily  analyzed  with  the  results  of  Section  8,  specifically  with 
(8.44)  -  (8.46).  Note  that  because  of  (8.32),  (8.11)  is  satisfied 
with  !i  =  a  and  thus  the  major  requirement  on  is  that  it 

have  good  approximation  properties. 

D.  We  have  seen  in  Remark  11.2  that  mixed  methods  for  eigenvalue 

approximation  have  the  form  of  (8.10)  and  (8.14).  Thus,  if  a 
method  satisfies  the  hypotheses  of  Section  8,  specifically  (8.1), 
(8.2),  (8.3),  (8.6),  (8.11),  and  (8.13),  then  the  method  can  also 

be  analyzed  with  the  results  of  Section  8.  Most  mixed  methods, 
however,  fail  to  satisfy  at  least  one  of  these  hypotheses,  and  we 
thus  cannot  rely  on  the  results  of  Section  8.  We  now  comment  on 
two  of  the  examples  discussed  in  Section  11  in  regard  to  which 
results  in  Chapter  II  their  analysis  is  based  on. 

1)  Consider  first  the  mixed  method  discussed  in  Subsection 
11. A.  for  the  membrane  problem.  It  is  easily  seen  that  the  varia¬ 
tional  formulation  (11.24)  satisfies  (8.1),  (8.2),  and  (8.3),  but 

that  it  does  not  satisfy  (8.6).  In  Section  8,  assumption  (8.6)  is 
used  to  show  that  the  operator  T  defined  by  (8.8)  is  compact. 

For  our  example,  for  (f,g)  e  H ( div,Q ) * H° (0 ) , 

T ( f , g )  =  (u,u), 

where  u  solves 

fu  €  H \(Q) 

1 

^-Au  =  g  in  Q 


1  V  V 

w 

,v.v 


-* 


m 


r’v'v 


m 


m 


V 


-C- 

>;V. 


ft* 

a  •  to  ■  i 


V 


‘v  v 


•»  '  V 

\V 


•‘v\ 


•\.v 


A 


:  <■  : 
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/  v 


•••  -><1 
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and  17  =  VU,  and,  by  noting  in  particular  the  dependence  of 
on  g,  we  see  that  T  :  H {  di v ,0  )  *  H°  (0  ) — ►  H(  div.ft  )  -  H°  (Q  )  is  not 
compact.  Since  I'  is  not  compact,  T^,  as  defined  by  (8.16), 
cannot  converge  to  T  in  norm.  Because  of  these  facts,  the 
results  of  Section  7  do  not  apply  (to  this  T).  The  analysis  that 
we  used  for  this  problem  (cf.  Theorem  11.1)  is  based  on  Section  7 
and  circumvents  this  difficulty  by  using  a  different  operator, 
namely  T  :  H°(ft) — >H°(0)  defined  by  Tg  =  u  (cf.  11.8c). 

As  mentioned  in  Remark  7.7,  results  for  noncompact  operators 
which  parallel  those  in  Section  7  have  been  proved  by  Descloux, 
Nassif,  and  Rappaz  [1978a,  1978b],  and  one  can,  if  fact,  use  them 
to  derive  the  estimates  we  obtained  in  Subsection  11. B,  specifi¬ 
cally  (11.32)  and  (11.33).  We  will  not  present  the  details  of 
this  analysis  but  will  comment  briefly  on  the  applicability  of  the 
results  of  Descloux,  Nassif,  and  Rappaz  [1978a,  1978b]  to  our  prob 
lem . 

For  their  results,  T  is  not  required  to  be  compact  and  T^ 
is  assumed  to  converge  to  T  in  the  sense  that 

(13.1) 

inf  il  (a  ,  u )  -  (  r  ,tj  )  ;|  „ — *0  for  each  (a,u)  e  H  ( div  ,0  )  •  H°  (0 

(  t,r?  )GSh=vh'  Wh 

and 

(13.2) 

i'T.-Tii  =  sup  !!  (T  -T)  (  f  ,g)j|  -  >0, 

(  f,g)eVh'Wh  H(  div  ,0  )  •  H  (0  ) 

11  (f  ,g)!' 

H  (  d  i  v  ,  0  )  ■  H  ( Q  )  .  „ 

as  h — *  0  . 

With  Vh  and  Wh  defined  as  in  (11.31),  (13.1)  follows  from  the 


approximation  result  in  Raviart  and  Thomas  [1977]. 

We  now  verify  (13.2),  which  is  central  point  in  any  applica 
tion  of  the  results  of  Descloux,  Nassif,  and  Rappaz  [1978a, 
1987b].  For  (f,g)  e  Vh*Wh'  let  (a<u)  =  T(f-9)  and  <ah'uh* 
Th(f,g),  where  T  and  are  defined  by  (8.8)  and  (8.16), 

respectively,  for  the  problem  discussed  in  Subsection  11. B.  We 


know  that  u  e  H1(0)  , 

-A  U  = 

g,  and  a  =  Vu, 

and  hence  div  a  = 

g.  Also,  if  g  e  Wh 

it  is 

easily  seen  that 

div  a ^  =  -g.  Thus 

!!  (T  -T)  (  f  ,g)  '1 

n  H  (  di  v  ,0  ) 

vH°(0) 

=  !!  (ah,uh)  -  (a  ,u) !! 

H ( div ,0 ) vH°(0 ) 

r 

-  !!div  a,  - div  a‘| 
h 

+  5a^\2°(o, 

(13.3) 

+  ,u  -Ul.  2  l1 
'  h  0,0) 

'la  -a!!" 
h  W°(0) 


+  !luh-u"o,o 


1/2 


for  (  f  ,  g  )  Vh'Wh' 


From  the  results  in  Falk  and  Osborn  [1980]  we  have 


13.4a) 


'a,  -a'!  . 
h  iH  0  (0  ) 


Ch;|  u!! 


<  Ch"g;i 


2  ,0 


0,0 


and 


(13.4b) 


Uh_U"  0 , 0  5  Ch  ''  U"  2  ,Q 


s  Ch  !!  g!: 


0,0 


Combining  (13.3)  and  (13.4)  we  get 
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(Th-T)  (f  ,g)il 


H  ( div  ,0  )xH  (0) 


Chllgl 


0,0 


(13.5) 


Ch||  (  f  ,  g) 


H ( div ,0  )*H  (0  ) 


for  (f.g)  e  VhxWh. 


K  (13.2)  follows  directly  from  (13.5). 

.  2)  Consider  next  the  method  discussed  in  Subsection  ll.C  for 
!'  the  vibrating  plate  problem.  The  variational  formulation  (11.38) 

i,  for  the  problem  does  not  satisfy  (8.2)  and  (8.11).  Note  that  the 

»■ 

method  was  analyzed  by  means  of  Theorem  11.1  which  is  based  on 
Theorem  7 . 3 

Remark  13.1.  The  fact  that  many  mixed  approximation  methods  fail 

i  ______ 

-1  to  satisfy  the  usual  hypotheses  (cf.  Babuska  [  1971,  1973]  and 

Brezzi  [1974])  for  variational  approximation  methods  is  an  issue 
for  the  approximation  of  source  problems  as  well  as  eigenvalue 
‘  problems.  The  abstract  results  in  Falk  and  Osborn  [1980]  have  as 

their  main  application  the  analysis  of  mixed  methods  which  fail  to 
satisfy  the  usual  hypotheses  for  variational  approximation  methods 
|  In  this  connection  see  also  Babuska,  Osborn,  and  Pitkaranta  [1980] 

where  problem  (11.38)  is  reformulated  in  terms  of  alternate  spaces 
)  with  alternate  (mesh  dependent)  norms  so  as  to  satisfy  the  usual 

hypotheses . 
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implementation  of  numerical  analysis  and  related  topics,  with  emphasis 
on  the  numerical  treatment  of  linear  and  nonlinear  differential  equa¬ 
tions  and  problems  in  linear  and  nonlinear  algebra. 

o  To  help  bridge  gaps  between  computational  directions  in  engineering, 
physics,  etc.,  and  those  in  the  mathematical  community. 

o  To  provide  a  limited  consulting  service  in  all  areas  of  numerical 
mathematics  to  the  University  as  a  whole,  and  also  to  government 
agencies  and  industries  in  the  State  of  Maryland  and  the  Washington 
Metropolitan  area. 

o  To  assist  with  the  education  of  numerical  analysts,  especially  at  the 
postdoctoral  level,  in  conjunction  with  the  Interdisciplinary  Applied 
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Science  Departments.  This  includes  active  collaboration  with  govern¬ 
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